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ABSTRACT 

Linear and equipercentile equating conversions were 
developed for two forms of the Graduate Record Examinations (GRE) 
quantitative test and the verbal-plus-quantitative test. From a very 
large sample of students taking the GRE in October 1981, 
subpopulations were selected with respect to race, sex, field of 
study, and level of performance (defined by GRE analytical test 
score*). The variance error equated scores between the forms was 
calculated, and conversion* vere made between the test forms. 
Departures of the conversions, based on each of these special samples 
from the population conversion, were calculated and evaluated in 
terms of the standard error of equating at five selected raw score 
points on Form D3, adjusted for disparity between the means, as well 
as variance, skewness, and kurtosis for these populations and the 
corresponding total populations. The conversions for the Physical 
Science subpopulation were significantly different from the total 
population. When adjustments were made for variance, skewness, and 
kurtosis, these conversions fell in line for 'che homogeneous GRE 
quantitative test, but not so cler-ly for the heterogeneous 
verbal-plus-quantitative test. Conversions for all other 
subpopulations were acceptably within range. The assumption of 
population- independence for equating was supported for homogeneous 
but not heterogeneous tests. (Author/GDC) 
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An Examination of the Assumption That the Equating 
of Parallel Forms Is Population-Independent 

Abstract 

Linear and equipercentile conversions were developed relating Forms 3DGR1 
and 3DGR3 of the homogeneous GRE quantitative test and the specially 
constituted heterogeneous GRE verbal-plus-quantitative test, using randomly 
equivalent populations of about 13,500 cases takiag each form, drawn from the 
entire candidate group tested in the regular October 1981 administration of 
the Graduate Record Examinations. For purposes of this study these samples of 
13,500 cases were taken as representing their respective total populations, 
and the conversions based on them were taken as "population conversions." 
Empirical standard errors of equating were then developed for samples of 1,000 
by drawing 100 samples of that size from each of the two base populations and 
calculating the variance error of equated scores on Form Dl (i.e., 3DGR1) for 
each successive raw score on Form D3 (i.e., 3DGR2) and fitting a second-degree 
equation to those vaiiance errors. Samples of 1,000 cases taking each form 
were then selected at random from specially defined subpopulations homogeneous 
with respect to sex, race, field of study, and level of performance (the last 
of these defined by scores on the GRE analytical test) and used as the basis 
of additional conversions between the two test forms. 

Departures of the conversions based cn each of these specially selected 
samples from the population conversion were calculated and evaluated in terms 
of the standard error of equating at five selected raw score points on 



ERLC 



4 



-ii- 



Porm D3, adjusted for the disparity between the means, as well as the three 
higher moments — variance, skewness, and kurtosis — for these subpopulations 
and the corresponding moments of the total populations. 

The results of this phase of the study showed that the conversions for 
the samples drawn from the Physical Science subpopulation differed 
significantly from that of the total population. When, in the second phase 
of the study, the appropriate adjustments for the moments were made, it was 
found that the Physical Sciences conversions fell clearly in line for the 
homogeneous GRE quantitative test, but not quite so clearly for the 
heterogeneous GRE verbal-plus-quantitative test. The conversions for all 
other subpopulation samples were acceptably within range of the population 
conversion. It was concluded that the assumption of population-independence 
for equating is supportable foi homogeneous tests but, because of evidence of 
nonpar^llelism between the two forms, the assumption is not as clearly 
established by the data of this study for heterogeneous tests. 
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Although the theory and practice of test equating have always been of great 
interest to test developers, they did not attract wide interest in psychometrics 
until about 1970. Since then, coincident with the need to compile data derived 
from the administration of the tests of different publishers (in order, for 
example, to evaluate the effectiveness of various compensatory education 
programs) and with the growing interest in and understanding of the many 
applications of item response theory, including equating, articles and books 
describing and evaluating the various methods of equating have appeared in the 
psychometric literature with increasing frequency. During these past 15 years, 
there has been an active interest in conducting studies to compare several 
methods with respect to the kinds of variations in design and application that 
affect their stability and possible bias, with respect to their underlying 
assumptions and their robustness in the face of violations of these assumptions, 
and with respect to their usefulness in practical applications. In addition to 
these studies, at least one new method of test equating was developed during 
this period (Holland & Wightman, 1982), and several modifications have been 
offered as improvements on methods that were already available and in frequent 
use. 

In spite of this active and continuing interest, there still remains a 
long-standing assumption basic to all equating that, to this date, has not been 
examined in any detail. This is the assumption that the characteristics of the 
populations used for equating have no effect on the outcome of the equating. 
Indeed, reference to this assumption has been made in the past without question, 
as though it were established fact (see, for example, Angoff, 1966; Angoff, 
1984, p. 86). Clearly, the casual attitudes so often observed in selecting 
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samples for equating, and the conventional practice of applying equating 
results to the wide varieties of subpopulations suggests that there is no lack 
of confidence in the truth of the assumption. 

The purpose of this study is to test the validity of this assumption 
explicitly and to understand a little more clearly the meaning of score equating. 
The question addressed here is this: Within the variation expected from random 
factors, is the conversion function that is developed to convert the scores on 
Form x of a test to the scale of Form y — assuming that Forms x and y have been 
designed to be parallel forms — the very same function, or does it vary, depending 
on whether it has been developed using samples coming from one or another sub- 
population? To what extent is the conversion function population-independent, 
that is to say, person-free? And to state the obverse question: To what extent 
is it appropriate to apply a given conversion function to a particular individual 
or group, irrespective of the nature of the group? Once the two forms have been 
equated, to what extent is the equating function sufficiently general that we can 
us the equated scores on the two forms interchangeably for any and all types of 
individuals and groups? That is, to what extent is the equated measurement 
test-free? 

It is by now generally accepted that it makes no logical sense to equate 
two tests that are known to measure different traits. The "equating" of, say, a 
verbal test to a math test is not expected to yield the same result for different 
populations. Men, for example, score at about the same level on verbal tests as 
women do, but substantially higher on math tests. As a consequence, an equating 
operation for tests as different as these will yield predictably different 
conversion equations for the two sexes. The question then remains: Will the 
conversion be the same across populations if the tests are parallel? 

o 8 
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The statement of this question assumes that the matter of parallelism has 
been settled by means of some other criterion or some other set of procedures, 
for the similarity among the equating results for different subpoulations is 
Itself sometimes taken as an operational test of parallelism for the two tests 
in question. The characteristic of parallelism and the data resulting from the 
several equating operations are so intimately related that the characteristic is 
sometimes assumed to coerce the operational result — that is, population-free 
equating; and conversely, the operational iesult is sometimes taken as a test of 
the characteristic. The present study is intended co test what the writers still 
regard as a highlv reasonable, but as yet untested, hypothesis that the 
characteristic does indeed coerce the operational result, even for forms that 
were designed to be measuring the same set of traits and are therefore at least 
approximately parallel. 

The study was designed with the exclusive use, to the extent possible, of 
empirical data, freeing it (again, to the extent possible) of any assumptions 
that might rend to limit the generalizations to be drawn from the study. Accord- 
ingly, measures of random variation were developed entirely empirically, and 
adjustments, derived from theory, were made in these measures only as necessary 
to reflect differences in the first four moments for the groups studied. Actual- 
ly, the study was designed to investigate the assumption of population-f re3 
equating with both linear and equipercentile models. It was therefore understood 
that, for the study of equipercentile equating, an empirical measure of error had 
to be obtained; standard formulas do not exist for the varieties of procedures 
used to conduct equipercentile equating. One such formula does indeed exist 
(Lord, 1982), but expectedly, it applies only to a particular way of treating 
the data. Other ways of treating the data — and there are several — are not fully 
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accounced for by Lord's formula. 

By way of background for the procedures followed In the study, the 
following Information will be useful. At the October 1981 administration of 
the GRE General Test, three forms of three tests were administered: a 76-item 
verbal test, a 60-item quantitative test, and a 50-item analytical ability 
test. The analytical ability test was introduced in the fall of 1977 for 
experimental purposes, pending a final decision by the GRE Board regarding its 
retention as a formal part of the testing program. During this experimental 
period, it was planned to conduct studies of reliability, validity, bias, and 
coachability for the analytical test, and its possible redundancy with the 
verbal and quantitative tests. These studies were to be undertaken to determine 
whether the analytical test would stand up under the kind of scrutiny it was 
expected to receive. For the purposes of the present study, however, only two 
of these three tests were planned for specific use. One of these was the 
quantitative test, which was chosen as the principal test of interest because 
performance on it was expected to show more variability among variously selected 
subpopulations than performance on the verbal test. On the other hand, it was 
expected that results based on the quantitative test alone — a highly homogeneous 
test — might not be considered an adequate test of the assumption that equating 
is population-independent. What was needed was a heterogeneous test as well, on 
which different types of subpopulations wo 'Id show different patterns of 
response. Accordingly, a heterogeneous test was formed, constructed simply by 
adding the scores on the verbal and quantitative tests for each person tested. 
The study of the verbal-plus-quantitative test represented a second phase of the 
study, paralleling the study of the quantitative test in all respects. 
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Procedu re 

As already Indicated, the data used for this study came from rhe regular 
October 1981 administration of the Graduate Record Examinations General Test 
(previously known as the GRE Aptitude Test), at which time a total of about 
56,460 examinees were tested. In addition to other forms that were adminis- 
tered to special populations on or about the same testing date, the three 
principal test forms of the General Test were Forms 3DGR1, 3DGR2, and 3DGR3 ? 
which had all been developed at the same time with the same content and 
statistical specifications. These three test forms were distributed to the 
examinees at the administration in "spiralled" fashion. That is, the test 
books were packaged and distributed In such a way that every 1st, 4th, 7th, 
10th, ... student In each testing room received the same form; every 2nd, 5th, 
8th, 11th, ... student were also given the same form, but different Trom the 
form given the first group; and every 3rd, 6th, 9th, 12th, ... student were 
similarly given the same form, but different from the first two. For the 
purposes of the study, Forms 3BGR1 (hereafter referred to as Form Dl) and Form 
3DGR3 (to be referred to es Form D3) were selected f principally because they 
appeared to be the most similar of the three possible pairs of forms with 
respect to their statistical characteristics. From those students taking 
Forms Dl or D3, about 35,650 in total number, data for all students whose test 
supervisor (chief proctor) reported some testing irregularity were removed. 
Further, data for all students who failed to mark an answer on any one of the 
sections of the test were also removed. And, finally, data for all students 
who reported thac English was not their preferred language were removed. 
As a result of these restrictions 13,470 cases (49.89%) taking Form Dl and 
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13,527 cases (50, 11%) taking Form D3 were finally considered usable for the 
study. 

From each of the two base populations, 100 samples of 1,000 cases each 
were selected at random. (Each individual in a sample was selected without 
replacement; each sample was selected with replacement. ) Ttv; samples in each 
population were numbered as they were selected, from 1 to 100, and the samples 
in the two populations tearing the same numbe^ were paired, resulting in 100 
random pairings. 

Scores on Form D3 were then equated by both linear and equipercentile 
methods to scores on Form Dl, once for each of the 100 pairings, by procedures 
appropriate to Design I (random groups — one test administered to each group; 
see Angoff, 1984, p. 94), in which, for linear equating, scores on two forms 
are defined as equivalent if their standard-score deviates [(x - M x )/s x ] in 
their respective, randomly equivalent groups are equal. For equipercentile 
equating, scores on the two forms are defined as equivalent if their percen- 
tile ranks for the two tests, in the two randomly equivalent groups, are equal. 

As a result of these equating operations, a bundle of 100 linear conver- 
sions and a bundle of more complex functions resulting from the 100 equiper- 
centile equatings were produced separately for the quantitative score and for 
the verbal-plus-quantitative score. Actually, the equipercentile functions 
were very nearly linear for the main body of the data; as mentioned earlier, 
the two forms were designed to be approximately parallel in their statistical 
and content characteristics. 

In preparation for equipercentile equating, the observed frequencies in 
each of the 200 distributi ons of GRE quantitative score and of GRE verbal— plus— 
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quantitative score (100 for Form Dl and 100 for Form D3) we'*e first smoothed by 
a seven-point rolling average method developed by E. E. Cureton and J. W. Tukey 
(Cureton & Tukey, 1951; Angoff, 1984, page 12). Following this operation, the 
smoothed frequencies were smoothea a second time ard by the same procedure. 
(As a possible alternative procedure, the smoothing was also carried out by 
fitting the negative hypergeometric function to the raw data (Keats & Lord, 
1962). However, the negative hypergeometric, while preserving the mean and 
standard deviation, failed to give a satisfactory fit and was therefore not 
used in the study.) No smoothing undertaken for the succession of 
equipercentile points re Idling Form Dl and Form D3; all point* were connected 
by straight lines, and converted score points on Form D3 were read off by 
computer by linear interpolation. 

For the sake of convenience and easier interpretation, raw scores on Form 
Dl (the test of reference; throughout this study, raw scores on Form D3 were 
equated to raw scores on Form Dl) were converted, using all 13,470 cases 
available for that form, to a scale in which the mean for that group was 
redefined as 500 and the standard deviation as 100. As a result of this 
operation, it was possible to observe the random variation, empirically 
developed, of the 100 linear and 100 equipercenti i? equatings on a 500-100 
scale. Specifically, this variation is the variation in standard scores, the 
scores converted to the standard (500-100) scale for Form Dl, corresponding to 
each given raw score on Form D3. The equation converting raw scores on Form Dl 
to the standard reference scale is given as follows for the GRE quantitative 
score : 
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S . [ 1UU ] x + 500 - [-±^-][38.l86l] 
q L 9.1A26 Q L 9.1426 JL 1 

S - 10.9378 x + 82.3281, 

q q 

where denotes raw GRE quantitative scores on Form Dl and Sq denotes the 

reference scale for those scores. 

Slmll? '"ly , the equation converting raw scores on Form Dl to the reference 
scale Is given as follows for the GRE verbal-plus-quant 1 tat lve scores: 

S - 5.2439 x + 58.7683, 
c c 9 

where x c denotes raw GRE verbal-plus-quantitative scores on Form Dl and S c 

denotes the reference scale for those scores. 

Once the 1,000-case samples referred to above were drawn from each of the 
two base populations, 11 special subpopulations wete defined and formed, 
homogeneous with respect to sex, ethnic background, academic field, or score 
level, as follows: Men, Women; Whites, Minorities; Humanities Majors, Social 
Science Majors, Biological Science Majors, Physical Science Majors; and High-, 
Middle-, and Low-scoring examinees. Minorities were defined as including 
Blacks plus Hispanics (only). 

The categories of High-, Middle-, and Low-scoring were constituted by 
selecting the highest-scoring 27 percent, the middle-scoring 46 percent, and 
the lowest-scoring 27 percent on the GRE analytical test. This, the role of 
defining the High-, Middle-, and Low-scoring groups, was the only role in the 
study played b> •"he analytical test. When the cuts effected by the foregoing 
percentages Intersected a score interval (whic.i occurred in every instance), 
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the appropriate number in the interval was assigned to one category or the 
other at random. 

That the use of the analytical score was effective in making these 
assignments into the High-, Middle-, and Low-scoring groups may be inferred 
from Tables 1 and 2, which give the means, standard deviations, and inter- 
correlations among the verbal, quantitative, analytical, and verbal-plus- 
quantitative scores. The correlations of analytical with quantitative 
are .7149 in the Form Dl population and .7133 in the Form D3 population, 
high enough to use for selecting disparately scoring groups. The corre- 
lations of analytical with verbal-plus-quantitative are even higher: .7375 



Table 1 

Means, Standard Deviations, and Intercorrelations among 
Verbal, Quantitative, Analytical, and Verbal-Plus-Quantitative Scores 
for the Entire Population Taking Form Dl 

N - 13,470 



Verbal 

Quantative 

Analytical 

Verbal Plus 
Quantitative 



Verbal 

.556 

.6068 

.9172 



Quanti- 
tative 

.5560 



.7149 
.8412 



Analyt- 
ical 

.6068 

.7149 



.7375 



Verbal 

Plus 
Quant i- 
tative 

.9172 

.8412 

.7375 



Mean 
45.9558 
38.1861 
28.0589 
84.1419 



Standard 
Deviation 

12.4072 
9.1426 
7.8178 

19.0698 
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Table 2 

Means t Standard Deviations, and Intercorrelations among 
Verbal, Quantitative, Analytical and Verbal-Plus-Quautitative Scores 
for the Entire Population Taking Form D3 

N - 13,527 





Verbal 


Quanti- 
tative 


Analyt- 
ical 


Verbal 

Plus 
Quanti- 
tative 


Mean 


Standard 
Deviation 


Verbal 




.5900 


.6560 


.9074 


46.0686 


11.2228 


Quancative 


.5900 




.7133 


.8747 


38.6139 


9.7347 


Analytical 


.6560 


.7133 




.7651 


26.3650 


7.6019 


Verbal Plus 


.9074 


.8747 


.7651 




84.6825 


18.6982 



Quantitative 



in the Form Dl population and .7651 in the Form D3 population. It may be seen in 
Tables 3 and 4, in the rows corresponding to High-Scoring, Middle-Scoring and Low- 
Scaring, that the means and standard deviations of the resulting si'bpopulations 
selected on the basis of the analytical score are in fact quite different. 

Tables 1 and 2 also throw some light on the parallelism of Forms Dl and D3. 
Although the correlations of quantitative and analytical scores are remarkably 
similar (.7149 vs. .7133), the correlations of verbal scores with quantitative 
scores (.5560 vs. .5900) and verbal scores with analytical scores (.6068 vs. 
.6560) differ more than would be ideal for operationally parallel forms. This 
observation is supported by the fact that the standard deviations, especially for 
verbal scores (12.4072 vs. 11.2228), but also for quantitative scores (9.1426 vs. 
9.7347), differ more than would be expected if the forms were closely parallel. 
Further discussion of the lack ot parallelism of the two forms appears later in 
this report. 

16 
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The constitution of the Minority subpopulations is given in the following 



table: 



Numbers of Examinees Taking: 
Form Dl Form D3 



Blacks 

Puerto Ricans 
Mexican Americans 
Other Hispanic 



705 
33 
195 
111 
1,044 



698 
30 
163 
109 



1,000 



To make comparisons between the conversion functions developed from cases 
drawn from the 11 subpopulations with those developed from cases drawn from the 
total populatirn, samples of 1,000, equal in size to each of the 100 samples 
drawn from the total population, were drawn from the 11 subpopulations. In all, 
20 initial samples of size 1,000 were drawn from the subpopulations taking Form 
Dl, two from each of the subpopulations except for the White and Minority 
subpopulations, from which only one sample each was drawn. These exceptions were 
occasioned by the fact that the two Minority groups — the group taking Dl and the 
group taking D3 — were so small; the entire group of Minorities taking Form D3, 
for example, numbered only 1,000, indeed, exactly 1,000. To match that number, a 
sample of 1,000 was drawn from the total group of 1,044 taking Form Dl. (For 
symmetry's sake, only one sample, also of 1,000 cases, was drawn from the White 
subpopulation. ) Within each subpopulat ion the samples were numbered 1 or 2 in 
Che order in which they were drawn. 

Similarly, 20 samples of 1,000 cases each were drawn from the 
subpopulations taking Form D3 — again, two from each of the subpopulations 
except the White and the Minority groups, from whicn only one sample each was 
drawn. (As mentioned above, the size of the Minority subpopulation taking 
Form D3 was almost as small as that taking Form Dl, 1,044 cases, as compared 
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with 1,000.) As In the sampling from the total population, each individual in 
a subpopulation sample was selected without replacement; each sample was 
selected with replacement. Here, too, within each subpopulation the samples 
were numbered 1 or 2 In the order In which they were drawn. 

For reasons to be described below, two additional samples of 1,000 cases 
each, Samples 3 and 4, were drawn from each of the two Physical Science 
subpopulations to make a total of 22 pairs of samples drawn from particular 
subpopulations. 

Finally, within each subpopulation the sample of 1,000 case 8 taking Dl was 
paired with the sample of 1,000 cases taking D3 bearing the same number, and 
Form D3 was equated to Form Dl by both linear and equipercentile methods as 
described above. All these processes were carried out for the GRE quantitative 
scores and again for the GRE verbal-plus-quant itat I ve scores. The numbers of 
cases, means, standard deviations, and skewness and kurtosis values (normal 
kurtosis taken as zero) for GRE quantitative scores on Forms Dl and D3 are 
given in Table 3. Corresponding statistics for GRE verbal-plus-quantitat' e 
are given in Table 4. 

In both Tables 3 and 4, the sums of the numbers of cases in the sub- 
populations defined by the categories of sex, ethnicity, and field of study 
fall short of the numbers of cases in the total populations taking Forms Dl and 
D3. This is so becau.e in some instances examinees neglected to provide the 
information needed to classify them. In the case of the ethnic category, it is 
additionally so because in this study the Minority subpopulation was limited to 
Blacks, Puerto Ricans, Mexican Americans, and other Hispanics. 
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Table 3 



Summary Statistics on GRE Quantitative Scores 
for the Total Populations and Subpopulations 
Taking Forms Dl and D3 



r'orm Dl Form D3 





No. of 

Cases 


Mean 


Std Dev 


Sk 


Ku 


No. of 
Cases 


Mean 


Std Dev 


Sk 


Ku 


Total 


13,470 


38.19 


9.14 


-.2573 


-.1339 


13,527 


38.61 


9.73 


-.1858 


-.4194 


Male 


5,863 


41.14 


8.97 


-.4468 


.0303 


5,700 


42.00 


9.55 


-.4057 


-.3038 


Female 


7,468 


35.91 


8.58 


-.2405 


-.0242 


7,706 


36.14 


9.08 


-.1521 


-.3244 


White 


11,409 


38.94 


8.51 


-.1506 


-.1940 


11,533 


39.32 


9.18 


-.1151 


-.4574 


Minority 


1,044 


28.43 


9.63 


.2747 


-.3837 


1,000 


28.76 


9.96 


.4325 


-.2095 


Humanities 


1,923 


36.07 


8.76 


-.2699 


.0388 


1,935 


36.76 


8.83 


-.0385 


-.4004 


Social Science 


5,840 


35.56 


8.53 


-.2737 


.0828 


5,913 


35.62 


9.12 


-.1357 


-.2928 


Biological Science 


2,811 


38.67 


7.78 


-.3576 


.1624 


2,892 


39.36 


8.26 


-.3249 


-.0953 


Physical Science 


2,264 


46.65 


7.16 


-.7390 


.6885 


2,150 


48.29 


7.16 


-.8855 


.9950 


High-Scoring 


3,637 


46.00 


6.29 


-.1549 


-.5059 


3,652 


46.78 


6.92 


-.3513 


-.3420 


Mid-Scoring 


6,196 


38.48 


6.77 


-.0306 


-.0269 


6,223 


39.10 


7.32 


.0004 


-.2347 


Low-Scoring 


3,637 


29.88 


7.85 


-.0727 


-.0855 


3,652 


29.61 


7.98 


.1527 


-.1498 
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Table 4 



Summary Statistics on GRE Verbal-Plus-Quantitative Scores 
for the Total Populations and Subpopulations 
Taking Forms Dl and D3 



Form Dl Form D3 





No. of 
Cases 


Mean 


Std Dev 


Sk 


Ku 


No. of 
Cases 


Mean 


Std Dev 


Sk 


Ku 


Total 


13,470 


84.14 


19.07 


-.3140 


-.1648 


13,527 


84.68 


18.70 


-.2891 


-.1814 


Male 


5,863 


87.79 


18.66 


-.3903 


-.0114 


5,700 


89.48 


18.35 


-.4105 


-.0925 


Female 


7,468 


81.36 


18.85 


-.2703 


-.2205 


7,706 


8i.:.8 


18.14 


-.2553 


-.1447 


White 


11,409 


86.15 


17.46 


-.1847 


-.2783 


11,533 


86.45 


17.21 


-.1705 


-.2649 


Minority 


1 ,044 


61.32 


19.73 


.3557 


-.3343 


1,000 


63.43 


20.11 


.4869 


-.0870 


Humanities 


1,923 


85.71 


18.97 


-.4833 


-.0617 


1,935 


85.69 


18.31 


-.2076 


-.3131 


Social Science 


5,840 


79.99 


19.09 


-.2353 


-.2840 


5,913 


80.02 


18.62 


-.2135 


-.1977 


Biological Science 


2,811 


84.40 


16.98 


-.3002 


-.0327 


2,892 


85.19 


16.20 


-.3051 


.0337 


Phyflcal Science 


2,264 


94.46 


17.05 


-.4697 


.0979 


2,150 


97.40 


15.65 


-.6470 


.6977 


High-Scoring 


3,637 


100.81 


12.42 


-.1895 


-.2534 


3,652 


101.23 


12.22 


-.1704 


-.2153 


Mid-Scoring 


6,196 


84.99 


13.82 


-.1026 


-.1491 


6,223 


85.93 


13.01 


-.0145 


-.1876 


Low-Scl lng 


3,637 


66.04 


16.13 


-.0P08 


-.1659 


3,652 


66.01 


15. 23 


-.0426 


-.1007 
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As expected, Table 3, based on the quantitative scores, shows that the 
highest-scoring subpopulation, even higher than that defined as "High-Scoring," 
is the Physical Science group, whose means on both Forms Dl and D3 are more 
than a standard deviation higher than those of any of the other curricular 
groups. Of these GRE subpopulations , the lowest, on both Forms Dl and D3, is 
the Minority group, whose means are more than a standard deviation lower than 
the White group. 

Table 4, based on the verbal-plus-quantitative scores, fails to show 
Physical Scienc. at quite the level seen in Table 3. This is to be expected, 
since their superiority in the verbal urea is not nearly as pronounced as it 
is in the quantitative area, and the verbal component in the verbal-plus- 
quantitative score diminishes the superiority of the Physical Science group 
seen in Table 3. The Minority group, however, is still the lowest in Table 4 
on both Form Dl and Form D3. 

The differences among the subpopulations of Interest in this study have a 
direct bearing on the methods used for evaluating the differences in the 
equating functions, since the standard error of equating is a function of the 
level and dispersion (as well as the skewness and kurtosis) of the groups used 
for equating. The matter of the choice of standard error in this context 
deserves some elaboration. In the first place, there is no single standard 
error of equating; it varies as a function of score level, following what 
appears to be second-degree function of the general form, SE^* ■ a + bx + cx^ 
(where SE^* is the variance error of equated scores expressed on some 
scale, y, and x is the score that is converted, by means of the equating 
function, to the scale of y), showing a minimum in the general vicinity of the 
mean of the x-scores and becoming increasingly larger with increasing distance 
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from that minimum point. Secondly, there were three possible variance error 
functions to choose from i<\ evaluating the results of this study • One of 
these is a formula due to Lord (Lord, 1950; see also Angoff, 1984, page 97) 
that assumes normal distributions in the test forms and populations of 
interest, A second is ..nula due to Sraun and Holland (1982, p. 33) that 
is more general, allowing consideration of degrees of skewness and kurtosis, 
but ignoring moments higher than the fourth. The third function was the 
system of empirical variance errors observed simply by calculating the 
variance of converted scores — developed from the 100 conversion functions 
described above — on Form Dl corresponding to each raw score on Form D3. This 
operation produces a function that is expected to be — and, in fact, was — 
perfectly smooth for linear equating, but that had to be fitted for 
equipercentile equating. After an examination and consideration of these 
three types of functions, it was decided to use the last, a smooth function of 
the empirically determined values, at least for this phase of the analysis. 
There were two reasons for coming to this decision: (1) the three functions 
did not appear to differ greatly or systematically; (2) since the study was 
designed from the outset to be as nearly empirical as possible, and since *v> 
analytical function for the variance error of equating could be develop* < for 
equipercentile equating, it was decided, for consistency's sake as well, to 
depend on empirically developed variance error throughout — at least, to the 
extent possible. 

As indicated above, the empirical variance error function for equi- 
percentile equating had to be fitted, and this was done by a second-degree 
equation. The plot of observed variance error points for equipercentile 
equating, corresponding to successive raw score value on Form D3, was found 
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to be U-shaped, similar in form to that found for linear equating. However, 
these points were not only slightly erratic, as expected, they ceased to be 
monotonic in the regions near the extremes of the scale. Accordingly, the 
curves of equipercentile error variances, one for GRE quantitative scores 
and one for GRE verbal-plus-quantitative scores, were fitted only within the 
large central regions where the ascending bars of the U-curves displayed 
monotonicity. 

Four second-degree equations, expressing the variance errors of equating 
on the standard reference scale as a function of raw score on Form D3, 
resulted from the foregoing operations and are given as follows for linear 
and for equipercentile equating of the GRE quantitative scores and the GRE 
verbal-plus-quantitative scores: 

SE y*U) " 196 * 8437 " 8.8533x q + 0.1102x2 (i) 
SE y*(e) " 665.5842 - 33.1299x q + 0.4218x2 (2) 
SE£*(a) - 228.5855 - 4.6933x c + 0.0259x2 ( 3 ) 
SE £*(e) " 718.0959 - 15.3412x c + 0.0829x2 (4) 
As may be seen by comparing in detail equation (1) with (2) and equation (3) 
with (4), the variance errors of equipercentile equating for these data are, 
for the most part, considerably larger than the variance errors of linear 
equating. The largest differences occur at the extremes of the Form D3 raw 
score scale and decline in size as one approaches the general vicinity of the 
mean. At their minimums the differences in variance error between the two 
equating methods are very small. For a short interval in the region of the 
minimum, the variance error of equipercentile equating is actually slightly 
smaller than the variance error of linear equating. 
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Equations (l)-(4) are repeated in equations (5)-(8), this time expressed 
in terms of standardized deviations, z(x), from the maans on v orm D3 (x) for 
the total population (M x - 38.6139, s x « 9.7347; M x • 84.6825, s x * 18.6982). 



q q c c 

SE y*U) * 1^-3386 - 3.3148z(x) q + 10.4459z 2 (x) q (5) 

SE y*(e) s 15.1670 - 5.4351z(x) q + 39.9678z2(x) q (6) 

SE#* (Jl ) * 17.0949 - 5.6400z(x) c + 9.0657z2(x) c (7) 

SE y*(e) * 13.2385 - 24.4178z(x) c + 28.9732z 2 (x) c (8) 



Here, the observation made in the preceding paragraph, comparing linear and 
equipercentile equating errors, may be verified directly. At the mean, where 
z(x)« 0, the variance errors cf equipercentile equating are clearly smaller, 
but only by a small amount, than the variance errors of linear equating. 

There are additional matters to consider in the choice of the appropriate 
standard error. As alluded to above, the error of determining an equated 
score on Form y corresponding to a given scor* on Form x of a test is in large 
part a function of the standardized distance of that x-score from the mean of 
the distribution of x-scores. For example, the formula for the variance error 
of equating for Design I equating (Angoff, 1984, p. 94), assuming normal 
distributions for both Forms x and y, is shown by Lord (1950; see also Angoff, 
1984, p. 97) to be 

2s2 

SE y* " ^ (z x + 2), (9) 

where SE^* = the variance error of equated y-scores, 

nt = the sum of the numbers of cases in the two groups used 
for equating, and 

z x = (x - Mx)/s x . 

24 
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It can be determined from equation (9) that the standard error of eqtiting is 
1.22 times as large at one standard deviation from the mean as at the mean, 1.73 
times as large at two standard deviations from the mean as at the mean, and 2.35 
times as large at three standard deviations from the mean as at the mean. This 
being the case, any equating bas^d on a distribution of x scores with a mean 
other than that characterizing the total group taking Form x would yield 
converted scores with different standard errors from those appropriate to the 
total population. A low-scoring group, for example, would have smaller standard 
errors than the total group for scores near the lower end of the scale and 
larger errors near the upper end of the scale. A high-scoring group, on the 
other hand, would have larger standard errors than the total group for scores 
near the lower end of the scale and smaller errors near the upper end of the 
scale. Accordingly, to convert an equation for variance error that was based on 
the mean and standard deviation of one group, say Group 1, to an equation for 
variance error appropriate to another group, say Group 2, with a different mean 
and standard deviation, the mean and standard deviation implicit in the equation 
appropriate to Group 1, say SE^* - a + bxj + cx 2 , needs to be replaced by the 
mean and standard deviation of Group 2. In effect, this change calls for the 
substitutionion of the value, (s x ^/a x ^)(x2 " M x ^) + f for If the more 

general equation is used, of the form, SE^* » A + Bz(x) + Cz 2 (x), as in 
equations (5)-(8), then the expre sion (x -M x Vs x , containing the values of mean 
and standard deviation for the appropriate group, needs to be used in place 
of z(x). 

In addition to the specification of the first and second moments (mean 
and standard deviation), it is appropriate also to consider specifying the 
third and fourth moments if they differ from the third and fourth moments 
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that characterize the total group from which the 100 samples of 1,000 cases 
were drawn. As mentioned earlier, the formula for the variance error of 
equating given in equation (9) assumes normal distributions in x and y, in 
which skewness and kurtosis are taken as zero. A more general formula for 
the variance error of equating, which permits the specification of the third 
and fourth moments (but ignoring higher order terms), is given by Braun and 
Holland (1982, page 33) and reproduced here, with minor changes in notation, 
as equation (10): 

SE2* - -I {1 + Sk(x)z(x) + [ 2 + K"(*)l z 2(x)} 
s 2 

+ {1 + Sk(y)z(x) + l 2 -JtKuMi z 2 (x)}f (10) 
ny 

where 8y* ■ the variance of scores on Form Dl expressed 

on the 500-1(0 scale, 

n x and n y « the number of cases taking Forms D3 and Dl, 
respectively, 

Sk(x) and Sk(y) - the values of skewness in the D3 and Dl 

distributions, respectively, 

ku(x) and Ku(y) » the corresponding values of kurtosis, 

and z(x) « the standarized distance from the score of interest 
to its own mean. 

When n x =* n y * n, equation (10) reduces to equacion (11): 

SE 2* «_L {2 + [sk(x) + Sk(y)]z(x) + [4 ± Ku( *> ± Ku( y> ] z 2(x)} , (11) 

in which normal kurtosis is taken as zero. It may be seen that when the dis- 
tributions of x and y are both normal, equation (11) reduces to equation (9). 

With the foregoing types of data and formulas available, it is possible 
to make several types of comparisons. The present study was restricted, 
however, to answering the following questions: (1) To what extent do the 
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conversions based on the samples drawi* from each of the several subpopulations 
agree with the overall conversion based on the total population of all cases 
(N ■ 13,470 taking Form Dl; N • 13,527 taking Form D3)? To evaluate these 
differences, the variance errors of equating were taken from the empirical 
variances of converted scores on Form Dl observed at selected Form D3 raw 
scores — scores 10, 20, 30, AO, end 50 on the GRE quantitative test, and scores 
25, 50, 75, 100, and 125 on the GRE verbal-plus-quantitative t«st. These 
variance errors were taken as observed in the equating 8 based on the 100 pairs 
of samples drawn from the overall population shown in equations (l)-(4), 
unadjusted for the mean, standard deviation, skewness, or kurtosis of the 
particular subpopula-tions. To repeat, then: the question addressed in this 
series of test^ is: How well would an equating based on a general population 
serve members of special subpopulations whose means and standard deviations 
differed from the overalls 

On the other hand, the foregoing question (1) may not be the most appro- 
priate one to ask. One can argue that the standard errors based on the 100 
pairs of samples drawn from the total population are not appropriate for 
groups whose distribution moments, especially the mean and standard deviation, 
differ from those of the total population. Accordingly, we may ask: (2) How 
well does an equating for a subpopulation agree with the equating for the 
overall population if the variance errors for the latter wer<* adjusted to 
reflect the fact that the subpopulat Ion's first four moments differed from 
those of the overall group? 

Ordinarily, this second question night have been answered simply by 
substituting the observed values tor the first four moments in equation (10) 
using the resulting variance errors of equating to evaluate the disparity 
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between the subpopulation conversions and the overall conversion at various 
score points of inteiebw. However, this procedure was judged not to be 
entirely satisfactory. In the first place, it was intended that the study 
should depend more fully than this procedure would call for on the empirically 
observed variation of the 100 conversions based on samples drawn from the 
general population* Secondly, there was some concern that the observed values 
of skewness and kurtosis for the subpopulations were too unstable to use, based 
as they were in some instances on relatively small groups, without some attempt 
to improve them. Now it ;jas observed fhat, as expected, there was a substan- 
tial negative correlation between the mean of a subpopulation and the value of 
the subpopulat ion's skewness, a relationship that may generally be sensed from 
an observation of the values of mean and skewness shown in Tables 3 and 4* For 
GRE quantitative scores, Form Dl, this correlation was -*71; for GRE quantative 
scores, Form D3, the correlation was -.90; for GRE verbal-plus-quantitative 
scores, Form Dl , -70; and GRE verbal-plus-quantitative scores, Form D3, -*72* 
(These correlations were considerably affected, it should be noted, by the 
extreme bivariate points for the Minority group and the Physical Science groups. 
The former yielded consistently low means and relatively high positive values of 
skewness; the latter had very high means and relatively low ckewness values*) 
It was therefore possible to uoe an entire data set of bivariate points for the 
12 populations (11 subpopulations plus the total) to derive more reliable 
estimates of skewness (a relatively unreliable statistic) from knowledge of 
the subpopulation mean (a relatively stable statistic) by regressing the former 
on the latter* Accordingly, new regressed ei :imates of skewness were calculated 
and the factor, [sk(x) + Sk(y)] i/[ Sk(x) + Sk(y)] p , was formed (in which each of 



ERIC 




-23- 



the values of skewness is an estimated value, i represents one of the 11 
subpopulations and p represents the total population) and used to correct , 
that is, moderate, the coefficient of z{x) in the equations corresponding to 
equations (5) and (7). These equations, it is recalled, were empirically 
derived ftrom samples drawn at random from the overall population. 

The observed and estimated skewnesses, and the factors used to moderate 
the coefficients of z(x) in equations (5) and (7) are given in Tables 5 and 6. 
The foregoing describes the process of correcting for the fact that the 
skewnesses of each of the subpopulations differed from those of the total 
population and doing so by taking advantage of their predictability from 
knowledge of the means. Efforts to make stable estimates of kurtosis, how- 
ever, were not as successful, and the judgment was made that, in view of the 
instability of the ' rtosis values, no special efforts would be made to adjust 
them. Instead, the actual values of kurtosis that were observed for the 
entire populations of 13,h70 and 13,527 cases (taking Forms Dl and D3, respec- 
tively) were, vith one exception, used as they were found. In effect, this 
meant using the observed coefficients of z 2 (x) in equations (5) and (7) — as 
well as the constant values (A) — for each of the various subpopulations. The 
exception noted above was the case of the Physical Science subpopulation, 
whose kurtosis values were generally so extreme that, it was judged, they 
could not be ignored. Therefore, for thr Physical Science group, a correction 
factor was developed, similar in construction to that used for adjusting the 
empirical skewness coefficients: [ 10 + 2.5Ku(x) + 2.5Ku(y)] p8 /[lO + 2.5Ku(x) 
+ 2.5Ku(y)]p, where the subscript ps represents the Physical Science sub- 
population and p represents the total population. In a fashion similar to 
the adjustments for skewness, this factor was used to moderate the coefficient 
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Table 5 



Observed and Estimated Values of Skewness, 
and Correct! n Factors, by Subpopulation 

GRE Quantitative Test 



Subpopulation 



Values of 
Skewness-Form Dl 
Observed Regressed 



Values of 
Skewness-Fonn D3 
Observed Regressed 



Correction 
Factor 



Total 


-.2573 


-.2381 


-.1858 


-.1799 




Male 


-.4468 


-.3327 


-.4057 


-.3488 


1.6303 


Female 


-.2405 


-.1652 


-.1521 


-.0561 


0.5294 


White 


-.1506 


-.7621 


-.1151 


-.2151 


1.1415 


Iilnorlty 


.2747 


.0745 


.4325 


.3119 


-0.9243 


Humanities 


-.2699 


-.1705 


-.0385 


-.0875 


0.6172 


Social Science 


-.2737 


-.1541 


-.1357 


-.0305 


0.4416 


Biological Science 


-.3576 


-.2538 


-.3249 


-.2173 


1.1270 


Physical Science 


-.7390 


-.5091 


-.8855 


-.6631 


2.8042 


High-Scoring 


-.1549 


-.4884 


-.3513 


-.5877 


2.5742 


Middle-Scoring 


-.0306 


-.2475 


.0004 


-.2043 


1.0808 


Low-Scoring 


-.0727 


.0281 


.1527 


.2695 


-0.7118 
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Table 6 



Observed and Estimated Values of Skewness, 
and Correction Factors, by Subpopulation 

GRE Verbal-Plus-Quantitative Test 



Values of Values of 

Skewness-Form Dl Skevness-Form D3 Correction 

Subpopulation Observed Regressed Observed Regressed Factor 



Total 


-.3140 


-.2373 


-.2891 


-.2008 




Male 
Female 


-.3903 
-.2703 


-.2905 
-.1969 


-.4105 
-.2553 


-.2867 
-.1381 


1.3174 
0.7645 


White 
Minority 


-.1847 
.3557 


-.2666 
.0952 


-.1705 
.4869 


-.2325 
.1799 


1.1392 
-0.6279 


Humanities 
Social Science 
Biological Science 
Physical Science 


-.4833 
-.2353 
-.3002 
-.4697 


-.2601 
-.1768 
-.2411 
-.3876 


-.2076 
-.2135 
-.3051 
-.6470 


-.2188 
-.1173 
-.2098 
-.4287 


1.0931 
0.6713 
1.0293 
1.8632 


High-Scoring 

Middle-Scoring 

Low-Scoring 


-.1895 
-,1026 
-.0808 


-.4801 
-.2496 
.0265 


-.1704 
-.0145 
-.0426 


-.4972 
-.2232 
.1338 


2.2308 
1.0793 
-0.3659 
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of s 2 (x> in equations (5) and (7). As implied by the foregoing, these 
adjustments were made in the linear equations (only) for both GRE quantitative 
and GRE verbal-plus-quantitative scores. Because so little is known about the 
parameters for the variance error of equipercentile equating, no adjustments 
were made in equations (6) and (8) except to relocate the scores of interest 
with respect to their means and standard deviations. 

Table 7 gives the equations for the variance errors of linear equating 
for GRE quantitative scores for each of the several subpopulations, in both 
standard score and raw score form. As described above, these equations have 
been adjusted for the first four moments of the subpopulation distributions. 
Table 8 gives the equations for equipercentile equating, which are, also as 
described above, adjusted only for mean and standard deviation. Tables 9 and 
10 give the corresponding equations for GRE verbal-plus-quantitative scores. 
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Table 7 



Equations for Empirical Variance Errors of Linear Equating, 
Adjusted for the First Four Moments of the Distributions for the 
Subpopulations of Interest 







GRE Quantitative 










SB 2 ,* - 


A + Bz(x) 

0 


+ Cz2(x) q 


se|* 


- a + bx q 


+ cx2 


Subpopulation 


A 


B 


C 


a 


b 


c 


Total 


19.3386 


-3.3148 


10.4459 


196.8437 


-8.8533 


0.1102 


Male 
Female 


Ditto 
Ditto 


- r 4040 
- ,7549 


Ditto 
Ditto 


245.0408 
191.6012 


-10.1826 
-9.3411 


0.1145 
0.1266 


White 
Minority 


Ditto 
Ditto 


-3.7840 
3.0637 


Ditto 
Ditto 


227.3162 
97.6056 


-10.1668 
-5.7496 


0.1240 
0.1053 


Humanities 
Social Science 
Biological Science 
Physical Science 


Ditto 
Ditto 
Dlf'o 
Ditto 


-2.0458 
-1.4638 
-3.7356 
-9.2952 


Ditto 
Ditto 
Ditto 
17.2249 


208.8877 
184.4776 
274.2450 
865.8115 


-10.0801 
-9.1111 
-12.4992 
-33.7576 


0.1339 
0.1256 
0.1530 
0.3361 


High-Scoring 

Middle-Scoring 

Low-Scoring 


Ditto 
Ditto 
Ditto 


-8.5327 
-3.5826 
-2.3596 


10.4459 
Ditto 
Ditto 


554.6409 
336.4294 
154.4809 


-21.6519 
-15.7289 
-9.4230 


0.2182 
0.1949 
0.1641 
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Table 8 



Equations for Empirical Variance Errors of Equipercentile Equating, 
Adjusted for the Means and Standard Deviations of the Distributions for the 

Subpopulations of Interest 



GRE Quantitative 



SE 2 ,* - a + bx q + cx2 



Subpopulation a b c 

Total 665.5842 -33.1299 0.4218 

Male 311.7291 -37.3646 0.4381 

Female 669.1853 -35.6000 0.4843 

White 772.1808 -37.9148 0.4746 

Minority 364.1782 -23.7216 0.4029 

Humanities 730.4559 -38.2972 0.5125 

Social Science 646.3721 -34.8426 0.4807 

Biological Science 948.2828 -46.7520 0.5855 

Physical Science 1870.4530 -76.0760 0.7798 

High-Scoring 1879.3237 -78.9107 0.8350 

Middle-Scoring 1184.2311 -59.0518 0.7456 

Low-Scoring 585.9281 -37.8670 0.6279 
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Table 9 



Equations for Empirical Variance Errors of Linear Equating, 
Adjusted for the First Four Moments of the Distributions for the 

Subpopulations of Interest 



GRE Vetbal-Plus-Quantitative 





SE 2 ,* « 


A + Bz(x) + 

c 


C 2 2(x) c 


SE2* - 


a ♦ bx + 

c 


cx2 

c 


Subpopulation 


A 


B 


C 


a 


b 


c 


Total 


17.0949 


-5.6400 


9.0657 


228.5855 


-4.6933 


0.0259 


Male 
Female 


Ditto 
Ditto 


-7.4302 
-4.3117 


Ditto 
Ditto 


268.7716 
218.0270 


-5.2206 
-4.7124 


0.0269 
0.0276 


White 
Minority 


Ditto 
Ditto 


-6.4249 
3.5416 


Ditto 
Ditto 


278.0194 
96.1123 


-5.6631 
-2.6675 


0.0306 
0.0224 


Humanities 
Social Science 
Bioglcal Science 
Physical Science 


Ditto 
Ditto 
Ditto 
Ditto 


-6.1651 
-3.7860 
-5.8052 
-10.5085 


Ditto 
Ditto 
Ditto 
11.8987 


244.5884 
200.8596 
298.1843 
543.3550 


-4.9732 
-4.3395 
-6.2411 
-10.1344 


0.0270 
0.0262 
0.0345 
0.0486 


High-Sec* i 

Middle-Scoring 

Low-Scoring 


Ditto 
Ditto 
Ditto 


-12.5817 
- 6.0872 
2.0635 


9.0657 
Ditto 
Ditto 


742.9482 
452.9112 
178.3997 


-13.3119 
-9.6751 
-5.0231 


0.0607 
0.0536 
0.0391 
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Table 10 



Equations for Empirical Variance Errors of Equipercentile Equating, 
Adjusted for the Means and Standard Deviations of Lhe Distributions for the 

Subpopulations of Interest 



GRE Verbal-Plus-Quantitative 





SE 2 ,* 


- a + bx c + 


cx 2 . 


Subpopulation 


a 


b 


c 


Total 


718.0959 


-15.3412 


0.0829 


Male 
Female 


820.8492 
703.0153 


-16.7211 
-15.6471 


0.0860 
0.0881 


White 
Minority 


866.6386 
378.4867 


-18.3243 
-10.3019 


0.0978 
0.0716 


Humanities 
Social Science 
Biological Science 
Physical Science 


762.3566 
653.4806 
942.4082 
1287.3908 


-16.1513 
-14.6901 
-20.3080 
-24.6022 


0.0865 
0.0836 
0.1104 
0.1183 


High-Scoring 

Middle-Scoring 

Low-Scoring 


2202.2282 
1438.8581 
663.1466 


-41.2518 
-31.3022 
-18.0895 


0.1939 
0.1712 
0.1249 
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Results 

Tables 11, 12, 13, and 14 give the results for Question 1: To what extent 
do the conversions based on the subpopulation samples agree with the total- 
population conversion if we take as the standard errors of equating those that 
were obvserved in the variation of the 100 conversions based on samples drawn 
at random from the total? With one notable exception, relating to the samples 
drawn from the Physical Science subpopulation, the results given in Table 11, 
based on linear equating for GRE quantitative scores, show that the samples 
drawn from their respective subpopulations yield conversions that are very much 
in line with the conversion based on the total population. There are occasional 
departures from expectation, X these are few in number and they tend to 
occur in the regions of the scale far removed from the vicinity of the mean 
for the particular subpopulation in question. For example, Sample 1 of the 
High-Scoring group shows significant departures (at the 5% level) at the lower 
end of the scale; one of the Middle-Scoring and one of the Low-Scoring samples 
show significant departures at the uppjr end of the scale. One clear explana- 
tion for these departures is that when the mean of the particular group 
differs from that of the total population, the standard errors used in the 
Question 1 phase of the study, taken from equations (l)-(4), are inappropriately 
small for scores at great distances from the mean of the group. (Corresponding, 
they are too large for scores close to the mean of the group.) As implied 
earlier, conversion lines based on samples drawn at random from a population 
tend to intersect in the region of the mean of the population and fan out at the 
ends of the scale, with greater fanning at greater distances from the mean. 

It is noteworthy in this connection that the conversion for the Minority 
sample showed no significant values at any of the five points. Because of 
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Table 11 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
Standardized Differences (t-values) at Selected Saw Scores on Form D3 

GRE Quantitative Scores; Linear Equating 

Standard Errors Unadjusted 



Form D3 Raw Scores 



Subpopulation Sample 



10 



20 



30 



40 



50 



Male 

Sample 1 
Difference 
SE Equating (Unadj) 
t-value 

Sample 2 
Difference 
SE Equating (Unadj) 
t-value 

Female 
Sample 1 
Difference 
SE Equating (Unadj) 
t-value 

Sample 2 
Difference 
SE Equating (Unadj) 
c-value 

White 
Sample 1 
Difference 
SE' Equating (Unadj) 
t-value 

Minority 
Sample 1 
Difference 
SE Equating (Unadj) 
t-value 



1.15 
10.92 
0. 11 



-2.31 
10.92 
-0.21 



6.78 
10 92 
0.62 



-15.54 
10.92 
-1.42 



12.20 
10 92 
1.12 



-9.81 
10.92 
-0.90 



-0.45 
7.99 
-0.06 



-3.20 
7.99 
-0.40 



4.99 
7.99 
0.62 



-13.45 
7.99 
-1.68 



6.00 
7.99 
0.75 



-6.84 
7.99 
-0.86 



-2.06 
5 52 
-0.37 



-4.09 
5.52 
-0.74 



3.20 
5.52 
0.58 



-11.36 
5.52 
-2.06* 



-0.19 
5.52 
-0.03 



-3.87 
5.52 
-0. 70 



-3.66 
4.37 
-0.84 



-4.99 
4.37 
-1.14 



1.41 
4.37 
0.32 



-9.27 
4.37 
-2. 12* 



-6.38 
4.37 
-1.46 



-0.90 
4.37 
-0.21 



-5.26 
5.45 
-0.96 



-5.88 
5.45 
-1.08 



-0.38 
5.45 
-0.07 



-7.18 
5.45 
-1.32 



-12.58 
5 45 
-2.31* 



2.07 
5 45 
0.38 



ERIC 
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Table 11 (continued) 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
Standardized Differences (t-values) at Selected Raw Scores on Form D3 

GRE Quantitative Scores; Linear Equating 

Standard Errors Unadjusted 



Form D3 Raw Scores 



Subpopulation Sample 

Humanities 
Sample 1 
Difference 
SE Equating (Unadj) 
t-value 

Sample 2 

Difference 

SE Equating (Unadj) 

t-value 

Social Science 
Sample 1 
Difference 
SE Equating (Unadj) 
t-value 

Sample 2 
Difference 
SE Equating (Unadj) 
t-value 

Biological Science 
Sample 1 
Difference 
SE Equating (Unadj) 
t-value 

Sample 2 
Difference 
SE Equating (Unadj) 
t-value 



10 



20 



30 



40 



50 



-11.56 
10.92 
-1.06 



-14.77 
10.92 
-1.35 



9.06 
10.92 
0.83 



0.43 
10.92 
0.04 



10.31 
10.92 
0.94 



-17.04 
10.92 
-1.56 



-8.23 
7.99 
-1.03 



-11.08 
7.99 
-1.39 



5.58 
7.99 
0.70 



0.58 
7.99 
0.07 



6.27 
7.99 
0.78 



-12.27 
7.99 
-1.54 



-4.90 
5.52 
-0.89 



-7.38 
5.52 
-1.34 



2.09 
5.52 
0.38 



0.74 
5.52 
0. 13 



2.23 
5.52 
0.40 



-7.50 
5.52 
-1.36 



-1.57 
4.37 
-0.36 



-3.69 
4.37 
-0.34 



-1.39 
4.37 
-0.32 



0.89 
4.37 
0.20 



-1.81 
4.37 
-0.41 



-2.73 
4.37 
-0.63 



1.75 
5.45 
0.32 



0.00 
5.45 
0.00 



-4.87 
5.45 
-0.89 



1.04 
5.45 
0.19 



-5.85 
5.45 
■1.07 



2.04 
5.45 
0.37 
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Table 11 (continued) 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
Standardized Differences (t-values) at Selected Raw Scores on Form D3 

GRE Quantitative Scores; Linear Equating 

Standard Errors Unadjusted 



Form D3 Raw Scores 



Subpopulation Sample 

Physical Science 
Sample 1 
Difference 
SE Equating (Unadj) 
t-value 

Sample 2 
Difference 
SE Equating (Unadj) 
t-value 

Sample 3 
Difference 
SE Equating (Unadj) 
t-value 

Sample 4 
Difference 
SE Equating (Unadj) 
t-value 

High-Scoring 
Sample 1 
Difference 
SE Equating (Unadj) 
t-value 

Sample 2 
Difference 
SE Equating (Unadj) 
t-value 



10 



20 



30 



40 



-7.85 
10.92 
-0.72 



-60.39 
10.92 
-5 . 53** 



-41 . 10 
10.92 
-3.76** 



-25 . 50 
10.92 
-2.33* 



23.56 
10.92 
2.16* 



7.39 
10.92 
0.68 



-6.87 
7.99 
-0.86 



-46.58 
7.99 
-5 . 83** 



-31.67 
7.99 
3.96** 



-20.15 
7.99 
-2.52* 



17.94 



4. 
7. 
0. 



99 
25* 



39 
99 

55 



-5.89 
5.52 
-1.07 



-32.77 
5.52 
-5 . 94** 



-22.24 
5.52 
-4.03** 



-14.80 
5.52 
-2.68** 



12.33 
5.52 
2.23* 



1.40 
5.52 
0.25 



-4.92 
4.37 
-1.13 



6.71 
4.37 
1.54 



-1.60 
4.37 
-0.37 



50 



-3.94 
5.45 
-0.72 



-18.96 -5.15 
4.37 5.45 
-4.34** -0.94 



-12.82 -3.39 
4.37 5.45 
-2.93** -0.62 



-9.46 -4.11 
4.37 5.45 
-2.16* -0.75 



1.09 
5.45 
0.20 



-4.60 
5.45 
-0.84 
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Table 11 (continued) 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
Standardized Differences (r-values) at Selected Raw Scores on Form D3 

GRE Quantitative Scores; Linear Equating 

Standard Errors Unadjusted 

Form D3 Raw Scores 



Subpopulation Sample 



Middle-Scoring 
Sample 1 
Difference 
SE Equating (Unadj) 
t-value 

Sample 2 
Difi'<sre :ce 
SE Equating (Unadj) 
t-value 

Low-Scoring 
Sample 1 
Difference 
SE Equat : ng (Unadj) 
t-value 

Sample 2 
Difference 
SE Equating (Unadj) 
t-value 



*p < 0.05 
**p < 0.01 

Note: Negative values in this table indicate that the sample conversion 
is lower than the total -population conversion. 



10 



20 



30 



40 



50 



-7.57 
10.92 
-0.69 



-4.76 
7.99 
-0.60 



-1.9/ 
5 52 
-0.35 



0.87 
4.37 
0.20 



3.68 
5.45 
0.67 



1.28 

30-92 
0.12 



-1.99 
7.99 
-0.25 



-5.26 
5.52 
-0.95 



-8.52 
4.37 
-1.95 



-11. #9 
5.45 
-2.16* 



-7.81 
10.92 
-0.72 



-3.77 
7.99 
-0.47 



0.27 
5.52 
0.05 



4.31 
4.37 
0.99 



8 

5.45 
1.53 



-4.41 
10.92 
-0.40 



0.32 
7.99 
0.04 



5.05 
5.52 
0.91 



9.78 
4.37 
2.24* 



14.50 
5.45 
2.66** 



EMC 
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Table 12 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
Standardized Differences (t-values) at Selected Raw Scores on Form D3 

GRE Quantitative Scores; Equipercentile Equating 

Standard Errors Unadjusted 



Form D3 Raw Scores 



Sub population Sample 

Male 

Sample 1 
Difference 
SE Equating (Unadj) 
t-value 

Sample 2 
Difference 
SE Equating (Unadj) 
t-value 

Fenale 
Sample 1 
Difference 
SE Equating (Unadj) 
t-value 

Sample 2 
Difference 
SE Equating (Unadj) 
t-value 

White 
Sample 1 
Difference 
SE Equating (Unadj) 
t-value 

Minority 
Sample 1 
Difference 
SE Equating (Unadj) 
t-value 



10 



20 



30 



40 



50 



10.45 
19.40 
0.54 



40.49 
19.40 
2.09* 



14.81 
19.40 
0. 76 



-25.85 
19.40 
-1.33 



4.41 
19.40 
0.23 



-23.62 
13.10 
-1.80 



-1.94 
13.10 
-0.15 



-2.47 
13.10 
-0.19 



-14.17 
13.10 
-1.08 



0.44 
13.10 
0.03 



-3.13 
13.10 
-0.24 



5.02 
7.16 
0.70 



-6.65 
7.16 
-0.93 



5.20 
7.16 
0. 73 



-8.16 
7.16 
-1.14 



0.91 
7.16 
0.13 



-1.59 
7.16 
-0.22 



-2.65 
3.90 
-0.68 



-4.58 
3.90 
-1.17 



-1.49 
3.90 
-0.38 



-9.38 
3.90 
-2.40* 



-6.94 
3.90 
-1.78 



-9.82 
3.90 
-2.52* 



-7.53 
7.97 
-0.94 



-2.26 
7.97 
-0.28 



3.78 
7.97 
0.47 



-10.78 
7.97 
-1.35 



-12.74 
7.97 
-1.60 



-1.71 
7.97 
-0.22 
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Table 12 (continued) 

Differences between Sample Conversions and Total-Popvlation Conversions, 

Standard Errors of Equating, and 
Standardized Differences (t-values) at Selected Raw Scores on Form D3 

GRE Quantitative Scores; Equipercentile Equating 

Standard Errors Unadjusted 

Form D3 Raw Scores 



Subpopulation Sample 10 20 30 40 50 



Humanities 
Sample 1 

Difference a -0.19 -2.53 -4.19 9.24 

SE Equating (Unadj) - 13.10 7.16 3.90 7.97 

t-value - -0.01 -0.35 -1.07 1.16 

Sample 2 

Difference a -5.56 -4.65 -4.05 -0.32 

SE Equating (Unadj) - 13.10 7.16 3.90 7.97 

t-value - -0.42 -0.65 -1.04 -0.04 

Social Science 
Sample 1 

Difference a 7.23 5.52 -5.47 -4.01 

SE Equating (Unadj) - 13.10 7.16 3.90 7.97 

t-value - 0.55 0.77 -1.40 -0.50 

Sample 2 

Difference 11.73 -5.77 -1.29 3.9. -4.11 

SE Equating (Unadj) 19.40 13.10 7.16 3.90 7.97 

t-value 0.60 -0.44 -0.18 1.00 -0.52 

Biological Science 
Sample 1 

Difference a 22.92 -7.02 -1.04 -4.80 

SE Equating (Unadj) - 13.10 7.16 3.90 7.97 

t-value - 1.75 -0.98 -0.27 -0.60 

Sample 2 

Difference a -25.12 -5.84 -3.60 0.46 

SE Equating (Unadj) - 13.10 7.16 3.90 7.97 

t-value - -1.92 -0.82 -0.92 0.06 
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Table 12 (continued) 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Erroxs ~>f Equating, and 
Standardized Differences (t-values) at Selected Saw Scores on Form D3 

GRE Quantitative Scores; Equipercentile Equating 

Standard Errors Unadjusted 



Form D3 Raw Scores 



Sub population Sample 

Physical Science 
Sample 1 
Difference 
SE Equating (Unadj) 
t-value 

Sample 2 
Difference 
SE Equating (Unadj) 
t-value 

Sample 3 
Difference 
SE Equating (Unadj) 
t-value 

Sample 4 
Difference 
SE Equating (Unadj) 
t-value 

High-Scoring 
Sample 1 
Difference 
SE Equating (Unadi) 
t-value 

Sample 2 
Difference 
SE Equating (Unadj) 
t-value 



JO 



20 



30 



40 



50 



20.83 
13.10 
1.59 



-45.06 
13.10 
-3.44** 



-28.86 
13.10 
-2.20* 



-51.50 
13.10 
-3.93** 



-2.17 
7.16 
-0.30 



-36./ 9 
7.16 
-5.10** 



-26.99 
7.16 
-3.77** 



-24.98 
7.16 
-3.49** 



19.77 
7.16 
2.76** 



S.24 
7.16 
0.73 



-10.18 
3.90 
-2.61** 



-21.86 
3.90 
-5.61** 



-15.43 
3.90 
-3.96** 



-7.51 
3.90 
-1.93 



8. 
3. 
2. 



17 
90 
10* 



25 
90 



0.06 



-2.25 
7.97 
-0.28 



-2.48 
7.97 
-0.31 



-1.38 
7.97 
-0.17 



-2.72 
7.97 
-0.34 



2.79 
7.97 
0.35 



-7.33 
7.97 
-0.92 
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Table 12 (continued) 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
Standardized Differences (t-values) at Selected Raw Scores on Form D3 

GRE Quantitative Scores; Equipercentile Equating 

Standard Errors Unadjusted 



Form D3 Raw Scores 



Subpopulation Sample 10 20 30 40 50 



Middle-Scoring 
Sample 1 

Difference a -28,72 -9.94 2.94 3.38 

SE Equating (Unadj) - 13.10 7.16 3.90 7.97 

t-value - -2.19* -1.39 0.75 0.42 

Sample 2 

Difference a 0.25 -12.17 -8.04 -15.26 

SE Equating (Unadj) - 13.10 7.16 3.90 7.97 

t-value - C.02 -1.70 -2.06* -1.92 

Low-Scoring 
Sample 1 

Difference 11.10 1.12 -2.84 3.10 8.02 

SE Equating (Unadj) 19-40 13.10 7.16 3.90 7.97 

t-value 0.57 0.09 -0.40 0.79 1.01 

Sample 2 

Difference 16.57 6.87 3.19 8.79 3.09 

SE Equating (Unadj) 19.40 13.10 7.16 3.90 7.97 

t-value 0.85 0.52 0.45 2.26* 0.39 



No data are available to establish conversions at these points. 

*p < 0.05 
**p < 0.01 

Note: Negative values in this table indicate that the sample conversion 
is lower than the total-population conversion. 
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Table 13 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors 'of Equating, and 
Standardized Differences (t-values) at Selected Raw Scores on Form D3 

GRE Verbal-Plus-Quai w .itative Scores; Linear Equating 

Standard Errors Unadjusted 



Form D3 Raw Scores 



Subpopulation Sample 

Male 

Sample 1 
Difference 
SE Equating (Unadj) 
t-value 

Sample 2 
Difference 
SE Equating (Unadj) 
t-value 



25 



1.75 
11.29 
0.15 



-10.08 
11.29 
-0.89 



50 



-2.65 
7.66 
-0.35 



-9.29 
7.6b 
-1.21 



75 



-7.06 
4.74 
-1.49 



-8.50 
4.74 
-1.79 



100 



-11.46 
4.31 
-2.66** 



-7.71 
4.31 
-1.79 



125 



-15.86 
6.86 
-2.31* 



-6.92 
6.86 
-1.01 



o ample 1 

Difference 1.98 3.91 5.84 7.77 9.70 

SE Equating (Unadj) 11.29 7.66 4.74 4.31 6.86 

t-value 0.18 0.51 1.23 1.80 1.41 

Sample 2 

Difference -15.78 -11.42 -7.06 -2.70 1.65 

SE Equating (Unadj) 11.29 7.66 4.74 4.31 6.86 

t-value -1.40 -1.49 -1.49 -0.63 0.24 

White 
Sample 1 

Difference 3.96 -1.42 -6.80 -12.18 -17.56 

SE Equating (Unadj) 11.29 7.66 4.74 4.31 6.86 

t-value 0.35 -0.18 -1.43 -2.83** -2.56* 

Minority 
Sample 1 

Difference 3.80 -1.71 -7.23 -12.74 -18.26 

SE Equating (Unadj) 11.29 7.66 4.74 4.31 6.86 

t-value 0.34 -0.22 -1.53 -2.96** -2.66** 
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Table 13 (continued) 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
Standardised Differences (t-values) at Selected Raw Scores on Form D3 

GRE Verbal-Plus-Quantitative Scores; Linear Equating 

Standard Errors Unadjusted 



Form D3 Saw Scores 



Subpopulation Sample 

Humanities 
Sample 1 
Difference 
SE Equating (Unadj) 
t-value 

Sample 2 
Difference 
SE Equating (Unadj) 
t-value 

Social Science 
Sample 1 
Difference 
SE Equating (Unadj) 
t-value 

Sample 2 
Difference 
SE Equating (Unadj) 
t-value 

Biological Science 
Sample 1 
Difference 
SE Equating (Unadj) 
t-value 

Sample 2 
Difference 
SE Equating (Unadj) 
t-value 



25 



50 



75 



100 



125 



3.23 
11.29 
0.29 



1.28 
11.29 
0.11 



7.16 
11.29 
0.63 



-3.98 
11.29 
-0.35 



-0.39 
11.29 
-0.03 



-24.09 
11.29 
-2.13* 



3.39 
7.66 
0.44 



1.70 
7.66 
0.22 



4.76 
7.66 
0.62 



-1.21 
7.66 
-0.16 



-2.05 
7.66 
-0.27 



-14.98 
7.66 
-1.95 



3.54 
4.74 
0.75 



2.11 
4.74 
0.45 



2.36 
4.74 
0.50 



1.55 
4.74 
0.33 



-3.72 
4.74 
-0.78 



-5.87 
4.74 
-1.24 



3.70 
4.31 
0.86 



2.52 
4.31 
0.59 



-0.04 
4.31 
-0.01 



4.31 
4.31 
1.00 



-5.38 
4.31 
-1.25 



3.24 
4.31 
0.75 



3.85 
6.86 
0.56 



2.93 
6.86 
0.43 



-2.44 
6.86 
-0.36 



7.07 
6.86 
1.03 



-7.05 
6.86 
-1.03 



12.35 
6.86 
1.80 
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Table 13 (continued) 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
Standardized Differences (t-values) at Selected Raw Scores on Form D3 

GRE Verbal-Plus-Quantitative Scores; Linear Equating 

Standard Errors Unadjusted 



Form D3 Raw Scores 



Sub population Sample 

Physical Science 
Sample 1 
Difference 
SE Equating (Unadj) 
t-value 

Sample 2 
Difference 
SE Equating (Unadj) 
t-value 

Sample 3 
Difference 
SE Equating (Unadj) 
t-value 

Sample 4 
Difference 
SE Equating (Unadj) 
t-value 

High-Scoring 
Sample 1 
Difference 
SE Equating (Unadj) 
t-value 

Sample 2 
Difference 
SE Equating (Unadj) 
t-value 



25 



50 



75 



100 



125 



-24.40 
11.29 
-2.16* 



-59.22 
11.29 
-5.25** 



-40.63 
11.29 
-3.60** 



-25.48 
11.29 
-2.26* 



6.99 
11.29 
0.62 



3.99 
11.29 
0.35 



-20.08 
7.66 
-2.62** 



-44.06 
7.66 
-5.75** 



-29.70 
7.66 
-3.88** 



-20.90 
7.66 
-2. 73** 



5.12 
7.66 
0.67 



1.14 
7.66 
0.15 



-15.76 
4.74 
-3.33** 



-28.91 
4.74 
-6.10** 



-18.77 
4.74 
-3.96** 



-16.33 
4.74 
-3.45** 



3.26 
4.74 
0.69 



-1.70 
4.74 
-0.36 



-11.44 
4.31 
-2.65** 



-13.75 
4.31 
-3.19** 



-7.84 
4.31 
-1.82 



-11.75 
4.31 
-2.73** 



1.39 
4.31 
0.32 



-4.55 
4.31 
-1.06 



-7.12 
6.86 
-1.04 



1.40 
6.86 
0.20 



3.10 
6.86 
0.45 



-7.18 
6.86 
-1.05 



-0.48 
6.86 
-0.07 



-7.39 
6.86 
-1.08 
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Table 13 (contir.ufe-i) 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
Standardized Differences (t-values) at Selected Raw Scores ou Form D3 

GRE Verbal-Plus-Quantitative Scores; Linear Equating 

Standard Errors Unadjusted 



Form D3 Raw Scores 



Subpopulation Sample 

Middle-Scoring 
Sample 1 
Difference 
SE Equating (Unadj) 
t-value 

Sample 2 
Difference 
SE Equating (Unadj) 
t-value 

Low-Scoring 
Sample 1 
Difference 
SE Equating (Unadj) 
t-value 

Sample 2 
Difference 
SE Equating (Unadj) 
t-value 



25 



50 



75 



100 



125 



-21.95 
11.29 
-1.94 



-35.08 
11.29 
-3.11** 



7.26 
11.29 
0.64 



4.92 
11.29 
0.44 



-12.53 
7.66 
-1.63 



-23.73 
7.66 
-3.10** 



4.50 
7.66 
0.59 



7.50 
7.66 
0.98 



-3.11 
4.74 
-0.66 



-12.38 
4.74 
-2.61** 



1.74 
4.74 
0.37 



10.08 
4.74 
2.13* 



6.32 
4.31 
1.47 



-1.03 
4.31 
-0.24 



-1.02 
4.31 
-0.24 



12.67 
4.31 
2.94** 



15.74 
6.86 
2.29* 



10.32 
6.86 
1.50 



-3.78 
6.86 
-0.55 



15.25 
6.86 
2.22* 



*p < 0.05 
**p < 0.01 

Note: Negative values in this table indicate that the sample conversion is 
lower than the total-population conversion. 
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Table 14 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
Standardized Differences (t-values) at Selected Raw Scores on Form D3 

GRE Verbal-Plus-Quantitative Scores, Equipercentile Equating 

Standard Errors Unadjusted 



Form D3 Raw Scores 



Sibpopulation Sample 

Male 

Sample 1 
Difference 
SE Equating (Unadj) 
t-value 

Sample 2 
Difference 
SE Equating (Unadj) 
t-value 

Female 
Sample 1 
Different 

SE Equating (Unadj) 
t-value 

Sample 2 
Difference 
SE Equating (Unadj) 
t-value 

White 
Sample 1 
Difference 
SE Equating (Unadj) 
t-value 



25 



a 



23.20 
19.66 
1.18 



50 



-16.21 
12.58 
-1.29 



7.33 
12.58 
0.58 



-0.39 
12.58 
-0.03 



-12.09 
12.58 
-0.96 



4.62 
12.58 
0.37 



75 



100 



-3.02 
5.80 
-0.52 



-8.91 
5.80 
-1.54 



10.00 
5 80 
1.72 



-4.35 
5.80 
-0.75 



-8.69 
5.80 
-1.50 



-14.49 
3.56 
-4.07** 



-6.81 
3.56 
-1.91 



8.62 
3.56 
2.42* 



0.07 
3.56 
0.02 



-13.25 
3.56 
-3.72** 



125 



-17.33 
9.76 
-1.77 



5. 
9. 
0. 



58 
76 
57 



-3.97 
9.76 
-0.41 



-2.80 
9.76 
-0.29 



-8.00 
9.76 
-0.82 



Minority 
Sample 1 

Difference 1.79 -1.04 -1.65 -18.40 -38.50 

SE Equating (Unadj) 19.66 12.58 5.80 3.56 9.76 

t-value 0.09 -0.08 -0.28 -5.17** -3.94** 
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Table 14 (continued) 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
Standardized Differences (t-values) at Selected Raw Scores on Form D3 

GRE Verbal-Plus-Quantitative Scores; Equipercentile Equating 

Standard Errors Unadjusted 

Form D3 Raw Scores 



Subpopulation Sample 25 50 75 100 125 



Humanities 
Sample 1 

Difference a 2.26 3.26 3.45 -13.33 

SE Equating (Unadj) - 12.58 5.80 3.56 9.76 

t-value - 0.18 0.56 0.97 -1.37 

Sample 2 

Difference a 0.54 2.28 2.00 0.76 

SE Equating (Unadj) - 12.58 5.80 3.56 9-76 

t-value - 0.04 0.39 0.56 0.08 

Social Science 
Sample 1 

Difference 15 . 31 15 . 06 0 . 74 1.27 22 . 50 

SE Equating (Unadj) 19.66 12.58 5.80 3.56 9 76 

t-value 0.78 1.20 0.13 0.36 2.30* 

Sample 2 

Difference 16.35 4.50 2.91 5.84 1.93 

SE Equating (Unadj) 19.66 12.58 5.80 3.56 9.76 

t-value 0 . 83 0 . 36 0.50 1 . 64 0 . 20 

Biological Science 
Sample 1 

Difference a 3.27 -4.15 -5.23 -23.28 

SE Equating (Unadj) - 12.58 5.80 3.56 9.76 

t-value - 0.26 -0.72 -1.47 -2.38* 

Sample 2 

Difference a -26.55 -9.84 6.49 2.90 

SE Equating (Unadj) - 12.58 5.80 3.56 9.76 

t-value - -2.11* -1.70 1.82 0.30 
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Table 14 (continued) 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
Standardized Differences (t-values) at Selected Raw Scores on Form D3 

GRE Verbal-Plus-Qi-uititative Scores; Equipercentile Equating 

Standard Errors Unadjusted 



Form D3 Raw Scores 



Subpopulation Sample 

Physical Science 
Sample 1 
Difference 
SE Equating (Unadj) 
t-value 

Sample 2 
Difference 
SE Equating ( Unadj) 
t-value 

Sample 3 
Difference 
SE Equating (Unadj) 
t-value 

Sample 4 
Difference 
SE Equating (Unadj) 
t-value 

High-Scoring 
Sample 1 
Difference 
SE Equating (Unadj) 
t-value 

Sample 2 
Difference 
SE Equating (Unadj) 
t-value 



25 



50 



75 



100 



125 



-9.11 
12.58 
-0.72 



-23.24 
12.58 
-1.85 



■18.09 
12.58 
-1.44 



■15.78 
12.58 
-1.25 



-20.55 
5.80 
-3.54** 



-31.73 
5.80 
-5.47** 



-24.81 
5.80 
-4.28** 



-23.77 
5.80 
-4.10** 



8.78 
5.80 
1.51 



39 
80 
62 



-11.33 
3.56 
-3 . 18** 



-15.52 
3.56 
-4.36** 



-9.25 
3.56 
-2.60** 



-12.03 
3 56 
-3.38** 



1.77 
3.56 
0.50 



-6.04 
3.56 
-1.70 



2.80 
9.76 
0.29 



5.43 
9.76 
0.56 



1.22 
9.76 
0.12 



0.03 
9.76 
0.00 



0.22 
9.76 
0.02 



-1.85 
9.76 
-0.19 
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Table 14 (continued) 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
Standardized Differences (t-values) at Selected Raw Scores on Form Dj 

GRE Verbal-Plus-Quantitative Scores; Equipercentile Equating 

Standard Errors Unadjusted 



Form D3 Raw Scores 



Subpopulation Sample 

Middle-Scoring 
Sample 1 
Difference 
SE Equating (Unadj) 
t -value 

Sample 2 
Difference 
SE Equating (Unadj) 
t-value 

Low-Scoring 
Sample 1 
Difference 
SE Equating (Unadj) 
t-value 

Sample 2 
Difference 
SE Equating (Unadj) 
t-valuc 



25 



16.43 
19.66 
0.84 



22.17 
19.66 
1.13 



50 



75 



100 



-55.91 
12.58 
-4.44** 



-24.76 
12.58 
-1.97* 



5.80 
12.58 
0.46 



4.89 
12,58 
0.39 



-1.64 
5.80 
-0.28 



-11.75 
5.80 
-2.03* 



4.82 
5.80 
0.83 



8.83 
5.80 
1.52 



7.29 
3.56 
2.05* 



-2.89 
3.56 
-0.81 



-8.08 
3.56 
-2.27* 



21.90 
3,56 
6.15** 



125 



-14.66 
9.76 
-1.50 



No data are available to establish conversions at these points. 

*p < 0.05 
**p < 0.01 

Note: Negative values in this table indicate that the sample conversion 
is lower than the total-population conversion. 
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their low means on these two forms, one would ordinarily have expected 
significant departures for their conversion line at the upper end of the 
scale. These departures did not occur in Table 11; they did, however, occur 
in Table 12, which io based on the results of equipercentile equating. 

Some mention should also be made here of the fact that, with the 
exception of the White and Minority groups, more than one sample was drawn 
from each of the subpopulations, even though some of the subpopulations were 
relatively small. Clearly, with small populations, there is likely to be far 
too much data common to the samples to yield experimentally independent 
observations. But having drawn the samples and calculated the results, the 
authors judged that the correct course of action would be to report all the 
data. As it stands, then, it should be kept in mind ttiat the two or more 
samples drawn from the smaller subpoulations yield somewhat redundant 
information. 

This discussion brings us to the data for the Physical Science group. 
Initially, only two pairs of samples were drawn from this subpopulation, as 
for all but two of the other subpopulations, and correspondingly, only two 
linear conversions were carried out for this group. When the results of these 
operations were reviewed (see the results for Samples 1 and 2), they were 
thought to be too different to accept without question, especially because of 
the large proportion of data likely to be common to them. Accordingly, the 
sampling process was reviewed, and when that was found to be free of error, 
two more samples, Samples 3 and 4, were also drawn to determine the direction 
in which the data were leaning. The results in Table 11 seem to indicate 
that, in spite of its consistency with the findings for all the other samples 
drawn from the 10 other subpopulations, the conversion for Sample 1 was as 
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divergent from the Physical Science subpopulstion conversion in **ie direction 
as was Sample 2 in the other direct ion. The r nversions for Samples 3 and 4 
were more intermediate and closer to their subpopulation conversion* 

In general, these results show that the conversions based on the samples 
drawn from the Physical Science subpopulation are clearly at odds with the 
Total-Population conversion, more so, as expected from the high level of its 
mean, at the lower end of the scale* Indeed, the conversion lines for all four 
samples fall below the overall conversion line, as does the line based on the 
entire Phyrical Science subpopu\ation. Ooviously, one explanation for the be- 
havior of the Physical Science group is the substantial difference between its 
mean and the population mean* However, other groups, whose means are similarly 
different from the population mean, dc not shows a degree of departure of tills 
magnitude* The Minority group, whose means are even more distant from the 
overall population means than the Physical Science group, shows no significant 
t-values at all; and the results for the High-Scoring group, whose means are 
also iTaCh higher than those of the total population, do not show significance 
at any point beyond the 5 percent level This is all the more perplexing 
because of the degree of overlap between the Physical Science and the High- 
Scoring groups* There were 997 cases common to these two groups in the Dl 
population, representing 44 percent of the Physical Science group and 27 percent 
o f the High-Scori g group; and there were 982 cases common to the two groups lu 
the 03 population, representing 46 percent of th« Physiol Science g*oup and 27 
percent of the High-Scoring group* With this degree of overlap, one might, 
perhaps, have expected greater similarity in their conversion equations* 

Examination of Table 12, which gives corresponding results for equi per- 
centile equating, reveals that it is essentially consistent wich Table 11 in 

55 
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all respects. As I.: Table 11, the only subpopulatlon that shows extensive 
~apartures fro© the population conversion is the Physical Science group. It 
Is noted parenthetically that, In Table 12 as in other tables that report the 
results of equlpercentlle conversions , there are some values that cannot be 
determined , simply because data are not available at those points. Unlike 
xlnear equating that, by definition, is based on the first two moments and 
yields a line capable if extrapolation without limit, equlpercentlle equating 
is largely a local process, depending on the availability of data at the score 
points of interest. If there are no data, there cannot be a conversion except, 
of course, by arbitrary extrapolation without the benefit of actual data. 

Table 13, based on linear equating data taken from the heterogeneous 
verbal-plus-quantitative test, yields information generally consistent with its 
counterpart, Table 11, which is also baeeC on linear equating but takes its 
data from the homogeneous quantitative test. Indeed, Table 13 shows slightly 
larger absolute t-values, on average, and a somewhat greater number of statisti- 
cally significant values than are found in Tabl? 11. As expected, the same 
results, generally, are found for equlpercentlle equating, as be seen in 

comparing Table 14 with Table 12. No satisfactory method or evaluating the 
statistical significance of the differences between Tables 11 and 12 on the one 
hand and Tables 13 and 14 o^ the other is available. The interpretation is 
further complicated by the fact, as observed earlier in reviewing Tables 1 and 2, 
that the forms or the heterogeneous teets are not strictly parallel, an 
observation supported by an examination of the distribution of item content over 
the 21 or 22 reading comprehension items in the verbal component of the GRE 
verbal-plus-quantitative test. It is customary in the construction of items 
of this type to represent each of the four rvrricular areas (humanities, 
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social science , biological science, and physical science) with the use of 
either a long passage followed by seven or eight items, or a short passage fol- 
lowed by three or four item*, and to alio;? the association of passage length 
(and number of items) with curricular area to vary from form to form* In the 
construction of Forms Dl and D3, the lengths of the passages and the numbers of 
items based on them were distributed as shown in the table below. From the 
information given in this table, it appears that Form Dl gave more weight to 
humanities and biological science and less to social science and physical 
science; Form D3 



Humanities 

Passage No* of 
Length Items 



Social Science 



Biological 
Science 



Passage No* of Passage No* of 
Length Items Length Items 



Physical 
Science 

Passage No* of 
Length Items 



Form Dl 
Form D3 



Long 
Short 



8 
3 



Short 
Long 



3 
7 



Long 
Long 



7 
8 



Short 
Short 



3 
4 



gave more weight to social and biological science and less to humanities and 
physical science* It is significant, however, that while physical science 
received relatively little weight in either form, it received more weight in 1)3 
than in Dl. Although it is difficult to speculate on the effects these differ- 
ences may have exerted on the study, it seems clear that the two forms were not 
in fact strictly parallel and that their differences may well have interacted 
with the performances of the four curriculum grraps in such a way as to affect 
their individual conversions as observed in this study* 

In reviewing the results summarized in Tables 11-14, it must be borne in 



mind that the departures of the conversion lines for the subpopulation samples 
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from the overall conversion line were evaluated statistically by using 
standard errors appropriate to a population with its own particular mean and 
standard deviation , not the mean and standard deviation of the subpopulation. 
As already pointed out, some of these subpopulations are characterized by 
mean scores at levels quite different from that of the general population 
and by degrees of skewness that are consistent with their means* The standard 
errors appropriate to these subpopulations are different from those appro- 
priate to the general population* Accordingly, Tables 15-18 are presented, 
which correspond respectively to Tables 11-14. They differ from the preceding 
tables in the respect that the empirical standard error formulas used to 
calculate the observed differences in the conversion lines have been adjusted 
with respect to the position of the mean and its standardised distance from 
the score points of interest. In addition, the standard error formulas for 
linear equating, in Tables 15 and 17, have also been adjusted for the third 
moment (skewness) and, whe?.e possible, the fourth moment (kurtosis). 

Figures 1 and 2 illustrate the discrepancies between sample conversions 
and the Total-Population conversion. Both figures have been oriented in such 
a way as to show t overall conversion as a base line and the sample 
conversion in relation to it. Figure 1 shows the discrepancy between the 
Female Sample 1 conversion and the base line conversion in relation to 1.96 
standard errors of equating (the "box") and in relation to 2.58 standard 
errors of equating (the "whisker"), when appropriate adjustments have been 
made in the standard errors for the first four moments of the distributions. 
The close agreement between the Female Sample 1 conversion and the Total- 
Population conversion is evident in Figure 1. Figure 2 gives a corresponding 
picture, but for a case of a large discrepancy, that between the Physical 
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Table 15 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
Standardized Differences (t-values) at Selected Raw Scores on Form D3 

GRE Quantitative Scores; Linear Equating 

Standard Errors Adjusted for the First Four Moments 



Form D3 Raw Scores 



Sub population Sample 

Male 

Sample 1 
Difference 
SE Equating (Adj) 
t-value 

Sample 2 
Difference 
SE Equating (Adj) 
t-value 

Female 
Sample 1 
Difference 
SE Equating (Adj) 
t-value 

Sample 2 
Difference 
SL Equating (Adj) 
t-value 

White 
Sample 1 
Difference 
SE Equating (Adj) 
t-value 

Minority 
Sample 1 
Difference 
SI Equating (Adj) 
t-value 



10 



20 



30 



40 



50 



1.15 
12.44 
0.09 



-2.31 
12.44 
-0.19 



6.78 
10.53 
0.64 



-15.54 
10.53 
-1.48 



12.20 
11.75 
1.04 



-9.81 
7.12 
-1.38 



-0.45 
9.34 
-0.05 



-3.20 
9.34 
-0.34 



4.99 
7.44 
0.67 



-13.45 
7.44 
-1.81 



6.00 
8.58 
0.70 



-6.84 
4.97 
-1.38 



-2.06 
6.53 
-0.31 



-4.09 
6.53 
-0.63 



3.20 
5- 03 
0.64 



-11.36 
5.03 
-2 . 26* 



-0.19 
5.83 
-0.03 



-3.87 
4.46 
-0.87 



-3.66 
4.5"' 
-0.80 



-4.99 
4.57 
-1.09 



1.41 
4 53 
0.31 



-> . 
4.53 
-2.05* 



-6.38 
4.37 
-1.46 



-0.90 
6.01 
-0.15 



-5.26 
4.71 
-1.12 



-5.88 
4.71 
-1.25 



-0.38 
6.40 
-0.06 



-7.18 
6.40 
•1.12 



-12.58 
5 39 
-2.33* 



2.07 
8.56 
0.24 
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Table 15 (continued) 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
Standardized Differences (t-values) at Selected Raw Scores on Form D3 

GRE Quantitative Scores; Linear Equating 

Standard Errors Adjusted for the First Four Moments 



Form D3 Raw Scores 



Subpopulation Sample 

Humanities 
Sample 1 
Difference 
SE Equating (Adj) 
t -value 

Sample 2 
Difference 
SE Equating (Adj) 
t-vslue 

Social Science 
Sample 1 
Difference 
SE Equating (Adj) 
t-value 

Sample 2 
Difference 
SE Equating (Adj) 
t-value 

Biological Science 
Sample 1 
Difference 
SE Equating (Adj) 
t-value 

Sample 2 
Difference 
SE Equating (Adj) 
t-value 



10 



20 



30 



40 



50 



-11.56 
11.02 
-1.05 



-14.77 
11.02 
-1.34 



9.06 
10 29 
0.88 



0.43 
10.29 

0.O', 



10.31 
12.83 
0.80 



-17.04 
12.83 
-1.33 



-8.23 
7.80 
-1.05 



-11.08 
7.80 
-1.42 



5.58 
7.25 
0.77 



0.58 
7.25 
0.08 



6.27 
9.25 
0.68 



-12.27 
9.25 
-1.33 



-4.90 
5.20 
-0.94 



-7.38 
5.20 
-1.42 



2.09 
4.92 
0.43 



0.74 
4.9? 
0.15 



2.23 
6.08 
0.37 



-7.50 
6 08 
-1.23 



-1.57 
4.47 
-0.35 



-3.69 
4.47 
-0.83 



-1.39 
4.59 
-0.30 



0.89 
4.59 
0.19 



-1.81 
4 37 
-C .41 



-2.73 
4 37 
-0.62 



1.75 
6.30 
0.28 



0.00 
6.30 
0.00 



-4.87 
6.56 
-0.74 



1.04 
6.56 
0.16 



-5.85 
5.64 
-1.04 



2.04 
5 64 
0.36 
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Table 15 (continued) 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
Standardized Differences (t-values) at Selected Raw Scores on Form D3 

GRE Quantitative Scores; Linear Equating 

Standard Errors Adjusted for the First Four Moments 

Form D3 Raw Scores 



Subpopulation Sample 10 20 30 40 50 



61 



Physical Science 
Sample 1 

Difference -7.85 -6.87 -5.90 -4.92 -3.94 

SE Equating (Ad j) 23.70 18.03 12.47 7.29 4.25 

t-value -0.33 -0.38 -0.47 -0.67 -0.93 

Sample 2 

Difference -60.39 -46.58 -32.77 -18.96 -5.15 

SE Equating (Ad j) 23.70 18 03 12 47 7 29 4.25 

t-value -2.55* -2.58** -2.63** -2.60** -1.21 

Sample 3 

Difference -41.10 -31.67 -22.24 -12.82 -3.39 

SE Equating (Ad j) 23.70 18.03 12.47 7.29 4.25 

t-value -1.73 -1.76 -1.78 -1.76 -0.80 

Sample 4 

Difference -25.50 -20.15 -14.80 -9.46 -4.11 

SE Equating 'Adj) 23.70 18.03 12.47 7.29 4.25 

t-value -1.08 -1.12 -1.19 -1.30 -0.97 

High-Scoring 
Sample 1 

Difference 23.56 17.94 12.33 6.71 1.09 

SE Equating (Adj) 18.97 14.45 10.07 6.14 4.20 

t-value 1.24 1.24 1.22 1.09 0.26 

Sample 2 

Difference 7.39 4.39 1.40 -1.60 -4.60 

SE Equating (Adj) 18.97 14.45 10.07 6.14 4.20 

t-value 0.39 0.30 0.14 -0.26 -1.09 
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Table 15 (continued) 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
.standardized Differences (t-values) at Selected Raw Scores on Form D3 

GRE Quantitative Scores; Linear Equating 

Standard Errors Adjusted for the First Four Moments 



Form D3 Raw Scores 



Subpopulatlon Sample 

Middle-Scoring 
Sample 1 
Difference 
SE Equating (Adj) 
t-value 

Sample 2 
Difference 
SE Equating (Adj) 
t-value 

Low-Scoring 
Sample 1 
Difference 
SE Equating (Adj) 
t-value 

Sample 2 
Difference 
SE Equating (Adj) 
t-value 



10 



20 



30 



AO 



50 



-7.57 
14.09 
-0.54 



1.28 
14.09 
0.09 



-7.81 
8.76 
-0.89 



-4.41 
8.76 
-0.50 



-4.76 
9.99 
-0.48 



-1.99 
9.99 
-0.20 



-3.77 
5.63 
-0.67 



0.32 
5.63 
0.06 



-1.94 
6.32 
-0.31 



-5.26 
6,32 
-0.83 



0.27 
4.^1 

0.06 



5.05 
4.41 
1.14 



0.87 
4.37 
0.20 



-8.52 
4.37 
-1.95 



4.31 
6.33 
0.68 



9.78 
6.33 
1.54 



3.68 
6.09 
0.60 



-11.79 
6.09 
-1.93 



8.34 
9.67 
0.86 



14.50 
9.67 
1.50 



*p < 0.05 
**p < 0.01 

Note: Negative values In this table Indicate that the sample conversion 
Is lower than the total-population conversion. 
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Table 16 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
Standardized Differences (t-values) at Selected Raw Scores on Form D3 

GRE Quantitative Scores; Equipercentile Equating 

Standard Errors Adjusted for Mean and Standard Deviation Only 



Form D3 Raw Scores 



Subpopulation Sample 10 20 30 40 50 



Male 

Sample 1 

Differences 10.45 -23.62 5.02 -2.65 -7.53 

SE Equating (Adj) 21.95 15.48 9.22 4.25 6 22 

t-value 0.48 -1.53 0.54 -0.62 -1.21 

Sample 2 

Differences 40.49 -1.94 -6.65 -4.58 -2.26 

SE Equating (Adj) 21.95 15.48 9.22 4.25 6.22 

t-value 1. 84 -0.13 -0. 72 -1 . 08 -0. 36 

Female 
Sample 1 

Differences 14.81 -2.47 5.20 -1.49 3.78 

SE Equating (Adj) 19.02 12.28 6.09 4.48 10. QO 

t-value 0.78 -0.20 0.85 -0.33 0.38 

Sample 2 

Differences -25.85 -14.17 -8.16 -9.38 -10.78 

SE Equating (Adj) 19.02 12.28 6 09 4.48 10 00 

t-value -i.36 -1.15 -1.34 -2.09* -1.08 

White 
Sample 1 

Difference a 0.44 0.91 -6.94 -12.74 

S£ Equating (Adj) - 14.27 7.87 3.87 7.94 

t-/alue - 0.03 0.12 -1.79 -1.61 

Minority 
Sample 1 

Difference 4.41 -3.13 -1.59 -9.82 -1.71 

SE Equating (Adj) 12.93 7.13 3.89 7.74 13.61 

t-value 0. 34 -0.44 -0.41 -1.27 -0.13 
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Table 16 (continued) 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
Standardized Differences (t-values) at Selected Raw Scores on Form D3 

GRE Quantitative Scores; Equipercentile Equating 

Standard Errors Adjusted for Mean and Standard Deviation Only 



Form D3 Raw Scores 



Subpopulation Sample 

Humanities 
Sample 1 
Difference 
SE Equating (Adj) 
t-value 

Sample 2 
Difference 
SE Equating (Adj) 
t-value 

Social Science 
Sample 1 
Difference 
SE Equating (Adj) 
t-value 

Sample 2 
Difference 
SE Equating (Adj) 
t-value 

Biological Science 
Sample 1 
Difference 
SE Equating (Adj) 
t-value 

Sample 2 
Difference 
SE Equating (Adj) 
t-value 



10 



20 



30 



40 



50 



11.73 
18.60 
0.63 



-0.19 
13.02 
-0.01 



-5.56 
13.02 
-0.43 



7.23 
11.91 
0.61 



-5.77 
11.91 
-0.48 



22.92 
15.73 
1.46 



-25.12 
15.73 
-1.60 



-2.53 
6.54 
-0.39 



-4.65 
6.54 
-0.71 



5.52 
5.81 
0.95 



-1.29 
5.81 
-0.22 



-7.02 
8.52 
-0.82 



-5.84 
8.52 
-0.69 



-4. 19 
4.31 
-0.97 



-4.05 
4 31 
-0.94 



-5.47 
4.67 
-1.17 



3.92 
4.67 
0.84 



-1.04 
3.87 
-0.27 



-3.60 
3.87 
-0.93 



9.24 
9.84 
0.94 



-0.32 
9.84 
-0.03 



-4.01 
10 19 
-0.39 



-4.11 
10.29 
-0.40 



-4.80 
8.63 
-0.56 



0.46 
8.63 
0.05 
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Table 16 (continued) 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
Standardized Differences (t-values) at Selected Raw Scores on Form D3 

GRE Quantitative Scores; Equipercentile Equating 

Standard Errors Adjusted for Mean and Standard Deviation Only 



Form D3 Raw Scores 



Sub population Sample 

Physical Science 
Sample 1 
Difference 
SE Equating (Adj) 
t-value 

Sample 2 
Difference 
SE Equating (Adj; 
t-value 

Sample 3 
Difference 
SE Equating (Adj) 
t-value 

Sample 4 
Difference 
SE Equating (Adj) 
t-value 

High-Scoring 
Sample 1 
Difference 
SE Equating (Adj) 
t-value 

Sample 2 
Difference 
SE Equating (Adj) 
t-value 



10 



20 



30 



40 



50 



20.83 
25. 71 
0.81 



-45.06 
25.71 
-1.75 



■28.86 
25. 71 
-1.12 



-2.17 
17.03 
-0.13 



-36.49 
17.03 
-2. 14* 



-26.99 
17.03 
-1.58 



-51.50 -24.98 
25.71 17 03 
-2.00* -1.47 



19.77 
16.23 
1.22 



-10.18 
8 67 
-1.17 



-21 . 86 
8.67 
-2.52* 



-15.43 
8 67 
-1.78 



-7.51 
8.67 
-0.87 



5.24 
16.23 
0.32 



8. 17 
7 67 
1.07 



0.25 
7.67 
0.03 



-2.25 
4.02 
-0.56 



-2.48 
4.02 
-0.62 



-1.38 
4.02 
-0.34 



-2.72 
4.02 
-0.68 



2.79 
4.61 
0.60 



-7.33 
4.61 
-1.59 
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Table 16 (continued) 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
Standardized Differences (t-values) at Selected Raw Scores on Form D3 

GRE Quantitative Scores; Equipercentile Equating 

Standard Errors Adjusted for Mean and Standard Deviation Only 

Form D3 Raw Scores 



Subpopulation Sample 10 20 30 40 50 



Middle-Scoring 
Sample 1 

Difference a -28.72 -9. 94 2.94 3.38 

SE Equating (Adj) - 17.36 9.15 3.89 9.78 

t-value - -1.65 -1.09 0.76 0.35 

Sample 2 

Difference a 0.25 -12. 17 -8.04 -15.26 

SE Equating (Adj) - 17.36 9.15 3.89 9.78 

t-value - 0.01 -1.33 -2.07* -1.56 

Low-Scoring 
Sample 1 

Difference 11.10 1.12 -2.84 3. 10 8.02 

SE Equating (Adj) 16.42 8.93 3.87 8.71 16.19 

t-value 0.68 0.13 -0.73 0.36 0.50 

Sample 2 

Difference 16.57 6.87 3.19 8.79 3.09 

SE Equating (Adj) 16.43 8.93 3.87 8.71 16.19 

t-value 1.01 0.77 0.82 1.01 0.19 



No data are available to establish conversions at these points. 

*p < 0.05 
**p < 0-01 

Note: Negative values in this table indicate that the sample conversion is 
lower than the total-population conversion. 
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Table 17 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
Standardized Differences (t-values) at Selected Raw Scores on Form D3 

GRE Verbal-Plus-Quantitative Scores; Linear Equating 

Standard Errors Adjusted for the First Four Moments 



Form D3 Raw Scores 



Sub population Sample 

Male 

Sample 1 
Difference 
SE Equating (Adj) 
t- value 

Sample 2 
Difference 
SE Equating (Adj) 
t-value 

Female 
Sample 1 
Difference 
SE Equating (Adj) 
t-value 

Sample 2 
Difference 
SE Equating (Adj) 
t-value 

White 
Sample 1 
Difference 
SE Equating (Adj) 
t-value 

Minority 
Sample I 
Difference 
SE Equating (Adj) 
t-value 



25 



50 



75 



100 



125 



1.75 
12.45 
0.14 



-10.08 
12.45 
-0.81 



1.98 
10.84 
0.18 



-15.78 
10.84 
-1.46 



3.96 
12 47 
0.32 



3.80 
6 59 
0.58 



-2.65 
8.66 
-0.31 



-9.29 
8.66 
-1.07 



3.91 
7.16 
0.55 



-11.42 
7.16 
-1.59 



-1.42 
8.45 
-0.17 



-1.71 
4.33 
-0.40 



-7.06 
5.35 
-1.32 



-8.50 
5.35 
-1.59 



-7 



-1 



84 
43 

,32 



.06 
43 
59 



-6.80 
5.04 
-1.35 



-7.23 
4.70 
-1.54 



-11.46 
3.98 
-2 . 88** 



-7.71 
3.98 
-1.94 



7. 77 
4.73 
1.64 



-2. 70 
4 73 
-0.57 



-12.18 
4.20 
-2. 90** 



-12.74 
7.31 
-1.74 



-15.86 
6.06 
-2 . 62** 



-6.92 
6 06 
-1.14 



9.70 
7.72 
1.26 



1.65 
7 72 
0.21 



-17.56 
6.94 
-2.53* 



-18.26 
10.63 
-1.72 
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Table 17 (continued) 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
Standardized Differences (t-values) at Selected Raw Scores on Form D3 

GR£ Verbal-Plus-Quantitative Scores; Linear Equating 

Standard Errors Adjusted for the First Four Moments 

Form D3 Raw Scores 



Subpopulation Sample 25 50 75 100 125 



68 



Humanities 
Sample 1 

Difference 3.23 3.39 3.54 3.70 3.85 

SE Equating (Adj) 11.71 7.97 4.88 4.22 6 76 

t-value 0.28 0.42 0.73 O.oS 0.57 

Sample 2 

Difference 1.28 1.70 2.11 2.52 2.93 

SE Equating (Adj) 11.71 7.97 4.88 4.22 S 76 

t-value 0.11 0.21 0.43 0.60 0.43 

Social Science 
Sample 1 

Difference 7.16 4.76 2.36 -0.04 -2.44 

SE Equating (Adj) 10.37 6 84 4.33 4.84 7 80 

t-value 0.69 0.70 0.54 -0.01 -0.31 

Sample 2 

Difference -3.98 -1.21 1.55 4.31 7.07 

SE Equating (Adj) 10.37 6.84 4.33 4.84 7. 80 

t-value -0.38 -0.18 0.36 0.89 0.91 

Biological Science 
Sample 1 

Difference -0.39 -2.05 -3.72 -5.38 -7.05 

SE Equating (Adj) 12.80 8.51 4.93 4.40 7.59 

t-value -0.03 -0.24 -0.75 -1.22 -0.93 

Sample 2 

Difference -24.09 -14.98 -5.87 3.24 12.35 

SE Equating (Adj) 12.80 8 51 4.93 4.40 7.59 

t-value -1.88 -1.76 -1.19 0.74 1.63 
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Table 17 (continued) 

Differences Istween Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
Standardized Differences (t-values) at Selected Raw Scores on Form D3 

GRE Verbal-Plus-Quantitative Scores; Linear Equating 

Standard Errors Adjusted for the First Four Hcnacncs 

Form D3 Raw Scoies 



Subpopulation Sample 25 50 75 100 125 



Physical Science 
Sample 1 

Difference -24.40 -20.08 -15.76 -11.44 -7.12 

SE Equating (Adj) 17 90 12.57 7.52 3.96 5.96 

t-value -1.36 -l.ftO -2.10* -2.89** -1.19 

Sample 2 

Difference -59.22 -44.06 -28.91 -1 .75 1.40 

SE Equating (Adj) 17.90 12 57 7.52 3.96 5.96 

t-value -3.31** -3.50** -3.84** -3.47** 0.24 

Sample 3 

Difference -40.63 -29.70 -18.77 -7.84 3.10 

SE Equating (Adj) 17.90 12.57 7.52 3.96 5.96 

t-value -2.27* -2.36* -2.50* -1.98* 0.52 

Sample 4 

Difference -25.48 -20.90 -16.33 -11.75 -7.18 

SE Equating (Adj) 17.90 12.57 7.52 3.96 5.96 

t-value -1.42 -1.66 -2.17* -2.97** -1.20 

High-Scoring 
Sample 1 

Difference 6.99 5.12 3.26 1.39 -0.48 

SE Equating (Adj) 21 17 5.13 9.26 4.30 5.19 

t-value 0.33 0.34 0.35 0.32 -0.09 

Sample 2 

Difference 3.99 1.14 -1.V0 -4.55 -7.39 

SE Equating (Adj) 21.17 15 13 9 26 4.30 5 19 

t-value 0.19 0.08 -0.18 -1.06 -1.42 
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Table 17 (continued) 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
Standardized Differences (t-values) at Selected Raw Scores on Form D3 

GRE Verbal-Plus-Quantitative Scores; Linear Equating 

Standard Errors Adjusted for the First Four Moments 

Form D3 Raw Scores 



Subpopulation Sample 25 50 75 100 125 



Middle-Scoring 
Sample 1 

Difference -21.95 -12.53 -3.11 6.32 15.74 

SE Equating (Adj) 15.64 10.15 5.35 4.59 8.98 

t-value -1.40 -1.23 -0.58 1.37 1.75 

Sample 2 

Difference -35.08 -23.73 -12.38 -1.03 10.32 

SE Equating (Adj) 15.64 10.15 5.35 4.59 8.98 

t-value -2.24* -2.34* -2.31* -0.22 1.15 

Low-Scoring 
Sample 1 

Difference 7,26 4.50 1*74 -1.02 -3.78 

SE Equating (Adj) 8.79 4.99 4.63 8.18 12.69 

t-value 0.83 0.90 0.37 -0.13 -0.30 

Sample 2 

Difference 4.92 7.50 10.08 12.67 15.25 

SE Eqi iting (Adj) C.79 4.99 4.63 8.18 12.69 

t-value 0.56 1.50 2. 13* 1.55 1.20 



*p < 0.05 
**p < 0.01 

Note: Negative values in this table indicate that the sample conversion is 
lower than the total-population conversion. 
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Table 18 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
Standardized Differences (t-values) at Selected Raw Scores on Form D3 

GRE Verbal-Plus-Quantitative Scores; Equipercentile Equating 

Standard Errors Adjusted for Mean and Standard Deviation Only 



Form D3 Raw Scores 



Subpopulation Sample 

itile 

Sample 1 
Difference 
SE Equating (Adj) 
t-value 

Sample 2 
Difference 
SE Equating (Adj) 
t-value 

Female 
Sample 1 
Difference 
SE Equating (Adj) 
t-value 

Sample 2 
Difference 
SE Equating (Adj) 
t-value 

White 
Sample 1 
Difference 
SE Equating (Adj) 
t-value 

Minority 
Sample 1 
Difference 
SE Equating (Adj) 
t-value 



25 



50 



75 



100 



U5 



23.20 
19.15 
1.21 



1.79 

12.87 
0.14 



-16.21 
14.14 
-1.15 



7.33 
14.14 
0.52 



-0.39 
11.87 
-0.03 



■12.09 
11.87 
-1.02 



4.62 
13.96 
0.33 



-1.04 
6.52 
-0.16 



-3.02 
7.11 
-0.42 



-8.91 
7 11 
-1.25 



10.00 
4.99 
2.00* 



-4.35 
4 99 
-0.87 



-8.69 
6 5C 
-1.34 



-1.65 
2.96 
-0.56 



-14.49 
2.96 
-4.89** 



-6.81 
2.96 
-2.30* 



8.62 
4.37 
1.97* 



0.07 
4.37 
0.02 



-13.25 
3.46 
-3.83** 



-18.40 
8.04 
-2.29* 



-17.33 
8.63 
-2.01* 



5.58 
8.63 
0.65 



-3.97 
11.11 
-0.36 



-2.80 
11.11 
-0.25 



-8.00 
10.19 
-0.79 



-38.50 
14.49 
-2.66** 
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Table 18 (continued) 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
Standardized Differences (t-values) at Selected Raw Scores on Form D3 

GRE Verbal-Plus-Quantitative Scores; Equipercentile Equating 

Standard Errors Adjusted for Mean and Standard Deviation Only 

Form D3 Raw Scores 



Subpopulatxon Sample 25 50 75 100 125 



Humanities 
Sample 1 

Difference a 2.26 3.26 3.45 -13.33 

SE Equating (Adj) - 13 07 6.11 3 44 9.72 

t-value - 0.17 0.53 1.00 -1-37 

Sample 2 

Difference a 0.54 2.28 2.00 0.76 

SE Equating (Adj) - 13,07 6.11 3 44 9.72 

t-value - 0.04 0,37 0.58 0.08 

Social Science 
Sample 1 

Difference 15.31 15.06 0.74 1.27 22.50 

SE Equating (Adj) 18.40 11 31 4.68 4.52 11.11 

t-value 0.83 1.33 0.16 0.28 2.03* 

Sample 2 

Difference 1C.35 4.50 2.91 5.84 1.93 

SE Equating (Adj) 18 40 11.31 4.68 4.52 11.11 

t-value 0.89 0.40 0.62 1.29 0.17 

Biological Science 
Sample 1 

Difference a 3.27 -4.15 -5.23 -23.28 

SE Equating (Adj) - 14.24 6.33 3.89 11.32 

t-value - 0.23 -0.66 -1.34 -2.06* 

Sample 2 

Difference a -26.55 -9.84 6.49 2.90 

SE Equating (Adj) - 14.24 6.33 3.89 11.32 

t-value - -1.86 -1.56 1.67 0.?6 
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Table 18 (continued) 

Differences between Sample Conversions and Total-Population Conversions, 

Standard Errors of Equating, and 
Standardized Differences (t-values) at Selected Raw Scores on Form D3 

GRE Verbal-Plus-Quantitative Scores; Equipercentile Equating 

Standard Errors Adjusted for Mean and Standard Deviation Only 

Form D3 Raw Scores 



Subpopulation Sample 25 50 75 100 125 



Physical Science 
Sample 1 

Difference a -9.11 -20.55 -11.33 2.80 

SE Equating (Adj) - 18.79 10 37 3.16 7.76 

t-value - -0.49 -1.98* -3.58** 0.36 

Sample 2 

Difference a -23.24 -31.73 -15.52 5.43 

SE Equating (Adj) - 18.79 10.37 3.16 7.76 

t-value - -1.24 -3.06** -4.91** 0.70 

Sample 3 

Difference a -18.09 -24.81 -9.25 1.22 

SE Equating (Adj) - 18.79 10.37 3 16 7.76 

t-value - -0.96 -2.39* -2.93** 0.16 

Sample 4 

Difference a -15.78 -23.77 -12.03 0.03 

SE Equating (Adj) - 18 79 10 37 3 16 7.76 

t-value - -0.84 -2.29* -3.81** 0.00 

High-Scoring 
Sample 1 

Difference a a 8.78 1.77 0.22 

SE Equating (Adj) - - 14.11 4.00 8.68 

t-value - - 0.62 0.44 0.03 

Sample 2 

Difference a a 9.39 -6.04 -1.85 

SE Equating (Adj) - - 14.11 4.00 868 

t-value - - 0.67 -1.51 -0.21 
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Table 18 (continued) 

Differences between Sample Conversions and Total-Populatiop Conversions, 

Standard Errors of Equating, and 
Standardized Differences (t-values) at Selected Raw Scores on Form D3 

GRE Verbal-Plus-Quantitative Scores; Equipercentile Equating 

Standard Errors Adjusted for Mean and Standard deviation Only 

Form D3 Raw Scores 



Subpopulation Sample 25 50 75 100 125 



Middle-Scoring 
Sample 1 

Difference a -55.91 -1.64 7.29 -14.66 

SE Equating (Adj) - 17.37 7.36 4.55 14.18 

t-value - -3.22** -0.22 1.60 -1.03 

Sample 2 

Difference a -24.76 -11.75 -2.89 a 

SE Equating (Adj) - 17.37 7.36 4.55 

t-value - -1.43 -1.60 -0.64 

Low-Scoring 
Sample 1 

Difference 16.43 5.80 4.82 -8.08 a 

SE Equating (Adj) 17.00 8.42 2.99 10.15 

t-value 0.97 0.69 1.61 -0.80 

Sample 2 

Difference 22.17 4.89 8.83 21.90 a 

SE Equating (Adj) 17.00 8.42 2.99 10.15 

t-value 1.30 0.58 2.96** 2.16* 

Nc data are available to establish conversions at these points. 

*p < 0.05 
**p < 0.01 

Note: Negative values in this table indicate that the sample conversion 
is lower than the total-population conversion. 
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Figure 1 . Illustration of Good Agreement 
between Subpopulation Sample Conversion 
and Total Population Conversion 

GRE-Quantitative; Linear Equating 
Standard Errors Adjusted for First Four Moments 
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Figure 2. Illustration of Relatively Poor Agreement 
between Subpopulation Sample Conversion 
and Total Population Conversion 

GRE-Quantitative; Linear Equating 
Standard Errors Adjusted for First Four Moments 
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Science Sample 2 conversion and the base line conversion, also in terms of 1.96 
standard errors and 2.58 standard errors, and also where adjustments have been 
made in the moments of the distribution. 

Returning to Table 15 and its equipercertile counterpart, Table 16, the 
values of t convey a much more favorable picture of population-independency for 
for equating than was found in Tables 11 and 12, where nc adjustments were made 
in the standard error formulas for the moments of the distributions. The 
Physical Science group is especially noteworthy here because in this table 
Sample 2, which yielded the steepest slope and lowest intercept of all the 
conversions calculated in this study, was the only one that showed consis- 
tently significant t-values. Moreover, although the discrepancies between 
the sample conversions for Physical Science and the conversion for the Total- 
Population were large, they have virtually no practical significance; the 
proportions of cases affected by these discrepancies were \ ry small. For 
example, only 2.0 percent of the Physical Science subpopulatlon earned raw 
scores lower than 30 on Form Dl; 1.6 percent earned raw scores lower than 30 
on Form D3. The conclusion to be drawn from Tables 15 and 16 is that, at 
least for this homogeneously constructed test — and presumably for other homo- 
geneous te3t8 — the assumption of population-independency is unchallenged. 

A slightly different picture emerges when we examine the results for the 
heterogeneous verbal-plus-quantitative test in Tables 17 and 18. These tables 
show a few scattered significant t-values in various places throughout the 
tables, most of them without clear pattern. The exception is, again, the 
Physical Science subpopulatlon. Sample 1 shows the smallest scaled score 
differences and t-values and Sample 2, the largest. In general, the values 
are somewhat larger than those in Tables 15 and 16, which were based on the 
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homogeneous quantitative test. What Is of some Interest here Is that the large 
significant differences are found consistently only for the Physical Science 
group, not for the High-Scoring group, whose mean GRE verbal-plus quantitative 
8 core is even higher than the Physical Science group's mean. This Is 
especially puzzling In view of the overlap, noted In connection with Tables 11 
and 12, between these two high-performing subpopulatlons. It can only be 
concluded that because, as observed earlier, Forms Dl and D3 are not as closely 
parallel as would have been preferred ft a test of the assumption, the 
findings for the Physical Science samples with respect to their use In the 
equating of this heterogeneous verbal-plus-quantltatlve >t may well be a 
consequence of the lack of parallelism between the t< o forms of the test. 

Incidental findings, ancillary to the central purpose of this study, 
have led to the following conclusions: 

1. Linear equating of two parallel forms of a test, using random 
groups and no anchor t»st, yields smaller standard errors than equ* percentile 
equating. This Is true at all points along the scale of scores except In the 
vicinity of the mean, where che errors are very nearly the same. The disparity 
between the two methods of treatln, the d&ta Increases sharply with Increasing 
distances from the mean. 

2. The method of smoothing data derived from the negative hyper- 
geometrlc distribution (Keats & Lord, 1962) yielded smoothed frequencies 

that departed considerably from the observed frequencies for long runs of data. 
It was therefore rejected as a method of choice for smoothing rheso data In 
favor of a method developed by Cureton and Tukey (1951). (See also Angoff, 
1984, page 12.) The Cureton-Tukey method makes use of a weighted rolling 
average of frequencies that neighbor the frequency of Interest, with 
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8maller weights for mo^e distant frequencies. From their experience in this 
study, the authors recommend that the results of any smoothing method be 
examined closely before applying it. It is entirely lik&ly that procedures 
like the Cureton-Tukey smoothing will be adequate for most equipercentile 
equating. 

3. Samples should be chosen fo*» equating that yield as nearly as 
possible rectangular distributions of scores, that is, with approximately equal 
numbers of cases at each point along the scale. This recommendation runs 
counter to the way in which equating samples are typically drawn, samples that 
yield bell-shaped distributions in which the mean of the distribution and the 
preponderance of cases fall at about the same point, and the frequencies 
decline in number in the direction of the extremes. Because the type of 
sampling recommended here (to achieve rectangular distributions) will reduce 
kurtosis, it may be inferred from equations (10) and (11) that it will also 
reduce the size of the coefficient of z^(x). If, in addition, efforts are made 
to select samples that yield symmetrical distributions, the size of the 
coefficient of z(x) will also be reduced. Both of these actions will have the 
effect of enhancing the precision of equating at the extremes of the scale 
where elating errors are likely to be very large. 
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Summary and Conclusions 

In this study, two general populations of students, one taking Form Dl, 
the other taking Form 03 of the GRE General Test at the regular October 1981 
administration, were used to equate th<* two forms of the homogeneous GRE 
quantitative test and also the specially constituted heterogenous GRE verbal- 
plus-quantitative test. Conversions developed from samples drawn fiom homo- 
geneously defined subgroups of the total populatior revealed that the conver- 
sion for GRE quantitative scores, derived from the total population, applied 
reasonably well to its various component subgroups. However, the conversion 
for GRE verbal-plus-quantitative scores, derived on the basis of total- 
population data, is in some doubt; it applies well for every one of the 
subgroups except for the Physical Science group. This being the case, it can 
only be concluded that the assumption of population-independence for homo- 
geneous parallel tests is clearly supported by these results, but because the 
forms used in this study were not very closely parallel, support for the 
corresponding assumption for heterogeneous tests is less clearly established. 

It is difficult on logical grounds for the authors to accept the con- 
clusion that the conversion line for parallel heterogeneous forms is dependent 
on the nature of the population cn which the equating is based when the con- 
versions for its component homogeneous tests are independent of the nature of 
that population. It would seem that a score of 100, say, on one form of a 
heterogeneous test, however that score was reached — by scoring 80 on verbal 
and 20 on quantitative for example, or 20 on verbal and 80 on quantitative — 
would have to convert to the same score on a parallel form of that hetero- 
geneous test, whether the members of the population on which the equating is 
based are verbally inclined or whether they are quantitatively inclined. The 
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key assumptions in coming to this conclusion are (1) that the two heterogeneous 
forms contain homogeneous component subtests that are parallel, at least 
approximately so, and (2) that the group taking one form and the group taking 
the other form are randomly drawn from the same population* It would seem to the 
authors then that, unlike the "equating- of nonparallel tests and unlike the use 
of nonrandom groups for equating, there would be no opportunity in properly 
designed and executed equating studies for different patterns of responre ma^e by 
different populations to affect the equating adversely. It would follow that the 
results of equating, even the equating of scores on heterogeneous, but pa*&ilel, 
forms of a test, should therefore be unique and generalizable to all populations. 

In this connection it is noted that in Lord's (1982) discussion of 
equating, he refers to Braun and Holland's (1982, page 15) definition, "Form X 
and Form Y are equated on [population] P if the distribution of the transformed 
[x] scores in population P is Lhe same as the distribution of the (untrans- 
formed) [y] scores," as "beyond reproach." For the general case, where, for 
example, Form x and Form y are not necessarily parallel, this definition is fully 
supported; and Lord provides an example of two tests of sharply different 
difficulty for whom an identity transformation, clearly misrepresenting the 
difference in difficulty between the fornu>, will result if the population used 
for equating guesses at random v> almost all the items of the two forms. The 
question addressed by this study is whether the Braun-Holland definition is not 
too .eneral, whether whe equating of two parallel forms — again, even forms 
that are cnly approximately parallel and reasonably appropriate for the samples 
used to equate theta — is not, allowing for random variation, sufficiently unique 
and general and independent, for most purpc es, of the type of population used 
for equating. The results of this study indicate that at least for homogeneous 
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parallel tests an equating function, properly developed, is in fact population- 
independent. As indicated elsewhere in this report, the assumption of 
population-independence is less clearly established here for heterogeneous 
tests, very likely because of the lack of complete parallelism of the two 
heterogeneous forms used in this study* 
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Table I 

Conversion Parameters and Conversions 
from Selected Form D3 Raw Scores to the Form Dl Reference Scale > 
Based on the Total Population, Subpopulations , and Subpopulation Samples 

GRE Quantitative Scores; Linear Equating 



Converted Scores 
Corresponding to 



Group 


Parameters 




Form D3 Raw 


Score 




Slope 


Interceot 


10 

X VS 


£. Vs 








Total Population 


10.2726 


103.3360 


206 


309 


412 


514 


617 


Male 
















Total Subpop'n 


10.2681 


101.0457 


204 


306 


409 


512 


614 


Sample 1 


10.1123 


106.0891 


207 


8 


409 


511 


612 


Sample 2 


10. 181? 

X VS . 1UJ£ 


101 97&0 




jUD 




CAQ 




Female 
















Total Subpop'n 


10.3356 


101.6045 


205 


308 


412 


515 


618 


Sample 1 


10.C937 


111.9021 


213 


314 


415 


516 


617 


Sample 2 


10.4814 


85.7127 


19) 

X 7 1 




/inn 




a i n 


White 
















Total Subpop'n 


10.1384 


109 5728 


211 


312 


414 


515 


616 


Sample 1 


9.6533 


121.7260 


218 


315 


411 


508 


604 


Minority 
















Total "ubpop'n 


10.5720 


89.1560 


195 


301 


406 


512 


618 


Sample 1 


10.5695 


90.5590 


196 


302 


408 


513 


619 


Humanities 
















Total Subpop'n 


10.8483 


7", 0707 


187 


295 


404 


512 


620 


Sample 1 


10.6053 


88.453; 


195 


301 


407 


513 


619 


Sample 2 


U.6419 


84.8739 


191 


298 


404 


511 


617 


Social Science 
















Total Subpop'n 


10.2348 


106.7015 


209 


111 


414 


516 


618 


Sample 1 


9.9245 


115.8739 


215 


:ii4 


414 


51? 


612 


sample 2 


10.2879 


103.6146 


206 


309 


412 


515 


618 


Biological Science 
















Total Subpop'n 


10.3010 


99.8530 


203 


306 


409 


5)2 


615 


Sample 


9.8687 


117.6855 


216 


3i.-> 


414 


512 


611 


Sample J 


10.7496 


81.524'j 


189 


297 


404 


512 


619 
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Table I (continued) 

Conversion Parameters and Conversions 
from Selected Form D3 Raw Scores to the Form Dl Reference Scale, 
Based on the Total Population, Subpopulations, and Subpopulation Samples 

GRE Quantitative Scores; Linear Equating 



Parameters 



Converts 1 Scores 
Corresponding to 
Form D3 Raw Score 



Group 


Slope 


Intercept 


10 


20 


30 


40 


50 


Physical Science 

















Total Subpop'n 


10.9418 


64.1202 


174 


283 


392 


502 


611 


Sample 1 


10.3703 


94.5100 


198 


302 


406 


509 


613 


Sample 2 


11.6538 


29.1306 


146 


262 


379 


495 


612 


Sample 3 


11.2153 


52- 8106 


165 


277 


389 


501 


614 


Sample 4 


10.8073 


72.4920 


181 


289 


397 


505 


613 


High-Scoring 
















Total Subpop'n 


9.9475 


120.0987 


220 


319 


419 


518 


617 


Sample 1 


9.7110 


132.5108 


230 


327 


424 


521 


618 


Sample 2 


9.9730 


113.7201 


213 


313 


413 


513 


612 


Middle-Scoring 
















Total Subpop'n 


10.1121 


107.7814 


209 


310 


411 


512 


613 


Sample 1 


10.5538 


92.9565 


198 


304 


410 


515 


621 


Sample 2 


9.9459 


107.8810 


207 


30/ 


406 


506 


605 


Low-Scoring 
















Total Subpop'n 


10.7956 


90.2983 


198 


306 


413 


521 


629 


Sample 1 


10.6765 


91.4856 


198 


305 


412 


519 


625 


Sample 2 


10.7455 


94.1957 


202 


309 


417 


524 


631 
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Table II 

Conversions from Selecte Form D3 Raw Scores 
to the Form Dl Reference Scale, 
Based on the Total Population, Subpopulations , and Subpopulation Sampl 

GRE Quantitative Scores; Equipercentile Equating 



Converted Scores 
Corresponding to 
Form D3 Raw Score 



Group 


10 


20 


30 


40 


50 


Total Population 


189 


300 


416 


514 


615 


Male 












Total Subpop'n 


159 


297 


415 


51" 


612 


Sample 1 


200 


277 


421 


51i 


608 


Sample 2 


230 


298 


410 


509 


613 


Female 












Total Subpop'n 


193 


301 


416 


514 


618 


Sample 1 


204 


298 


421 


512 


619 


Sample 2 


163 


236 


408 


504 


605 


wui ue 












lotai buDpop n 


166 


304 


417 


515 


615 


Sample 1 


a 


301 


417 


507 


603 


Minority 












Total Subpop'n 


193 


295 


413 


503 


615 


Sample 1 


194 


297 


415 


504 


614 


Humanities 












Total Subpop'n 


a 


282 


410 


511 


619 


Sample 1 


a 


300 


414 


510 


625 


Sample 2 


a 


295 


412 


510 


615 


Social Science 












Total Subpop'n 


200 


302 


418 


516 


616 


Sample 1 


a 


308 


422 


508 


611 


Sample 2 


201 


295 


415 


518 


611 


Biological Sci'uce 












Total Subpop'n 


a 


302 


412 


512 


614 


Sample 1 


a 


323 


409 


5)3 


610 


Sample 2 


a 


275 


410 


510 


616 
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Table II (continued) 



Conversions from Selected Form D3 Raw Scores 
to tne Form Dl Reference Scale, 
Based on the Total Population, Subpopulations, and Subpopulation Samples 

GRE Quantitative Scores; Equipercentile Equating 



Converted Scores 
Corresponding to 







Form D3 Raw Score 


Group 


10 


20 


30 


40 


50 


Physical Science 












Total Subpop'n 


a 


293 


400 


498 


610 


Sample 1 


a 


321 


414 


504 


613 


Sample 2 


a 


255 


380 


492 


613 


Sample 3 


a 


271 


389 


498 


614 


Sample 4 


a 


249 


391 


506 


613 



High-Scoring 



Total Subpop'n 


a 


a 


425 


518 


617 


Sample 1 


a 


a 


436 


522 


618 


Sample 2 


a 


a 


421 


514 


608 


Middle-Scoring 












Total Subpop'n 


a 


297 


413 


512 


612 


Sample 1 


a 


272 


406 


517 


619 


Sample 2 


a 


3ul 


404 


506 


600 


Low -Scoring 












Total Subpop'n 


189 


301 


418 


514 


621 


Sample 1 


200 


301 


413 


517 


623 


Sample 2 


206 


307 


419 


523 


618 



No data are available at these points to establish 
conversions. 



ERJC 



87 



-82- 



Table III 

Conversion Parameters and Conversions 
from Selected Form D3 Raw Scores to the Form Dl Refereuce Scale, 
Based on the Total Population, Subpopulations, and Subpopulation Samples 

GRE Verbal-Plus-Quantitative Scores; Linear Equating 



Parameters 



Converted Scores 
Corresponding to 
Form D3 Raw 3core 



Group 


Slope 


Intercept 


25 


50 


75 


100 


125 


Total Population 


5.3481 


47.1093 


181 


315 


448 


582 


716 


Male 

To tax Subpop'n 
Sample i 
Sample 2 


5.3306 
5.1720 
5.3797 


42.1521 
53.2613 
36.2442 


175 
183 
171 


309 
312 
305 


442 
441 
440 


575 
570 
5^4 


708 
700 
709 


Female 

Total Subpop'n 
Sample 1 
Sample 2 


5.4489 
5.4253 
5.5224 


43.0848 
47.1570 
26.9756 


179 
183 
165 


316 
318 
303 


452 
454 
441 


588 
590 
579 


724 
725 
717 


White 

Total Subpop'n 
Sample 1 


5.3197 
5.1329 


50.6382 
56. 4541 


184 
185 


317 
313 


450 
441 


583 
570 


716 
698 


Minority 

Total Subpop'n 
Sample 1 


5.1444 
5.1275 


53.9863 
56.4248 


183 
185 


31J 
313 


440 
441 


568 
569 


697 
697 


Humanities 

Total Subpop'n 
Sample 1 
Sample 2 


5.4335 
5- 3543 
5.3646 


42.6379 
50.1868 
47.9815 


178 
184 
182 


314 
318 
316 


450 
452 
450 


586 
586 
584 


722 
719 
719 


Social Science 
Total Subpop'n 
Sample 1 
Sample 2 


5.3773 
5.2521 
5.4586 


47.9108 
56.6652 
40.3703 


182 
188 

177 


.",17 
319 
313 


451 
451 
450 


586 
582 
586 


720 
713 
723 


Biological Science 
Total Sibpop'n 
Sample 1 
Sample 2 


5.4964 
5.2815 
5.7125 


33.U68 
48.3868 
13.9108 


171 
180 
157 


308 
312 
300 


445 
444 
442 


583 
577 
585 


720 
709 
728 
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Table III (continued) 

Conversion Parameters and Conversions 
from Selected Form D3 Raw Scores to the Form Dl Reference Scale, 
Based on the Total Population, Subpopulations , and Subpopulation Samples 

GRE Verbal-Plus-Quant itacive Scores; Linear Equating 



Converted Scores 
Corresponding to 



Group 


Parameters 




Form 


D3 Raw 


Score 




Slope 


Intercept 


25 


50 


75 


100 


125 


Physical Science 






— — 


— — 


— 







Total Subpop'n 


5.7117 


-2.2502 


141 


283 


426 


569 


712 


Sample 1 


5.5209 


18.3923 


156 


294 


432 


570 


709 


Sample 2 


5.9543 


-27.2611 


122 


270 


419 


568 


717 


Sample 3 


5.7854 


-4.4566 


140 


2*^ 


429 


574 


719 


Sample 4 


5.5311 


17.0565 


155 




432 


570 


708 


High-Scoring 
















Total Subpop'n 


5.3295 


47.9151 


181 


314 


448 


581 


714 


Sample 1 


5.2734 


55.9681 


188 


320 


451 


583 


715 


Sample 2 


5.2343 


53.9418 


185 


316 


447 


577 


708 


Middle-Scoring 
















Total Subpop'n 


5.5707 


25.7150 


165 


304 


444 


583 


722 


Sample 1 


5.7250 


15.7345 


159 


302 


445 


588 


731 


Sample 2 


5.8021 


0.6808 


146 


2S1 


436 


581 


726 


Low-Ccoring 
















Total Subpop'n 


5.5541 


38.4494 


177 


3i6 


455 


594 


733 


Sample 1 


5.2377 


57.1245 


188 


319 


450 


581 


712 


Sample 2 


5.4514 


49.4465 


186 


3?2 


458 


595 


731 
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Table XV 



Conversions from Selected Form D3 Raw Scores 
to the Form Dl Reference Scale, 
Based on the Total Population, Subpopulatlons , and Subpopulatlon Samples 

GRE Verbal-Plus-Quantltatlve Scores; Equlpercentlle equating 



Converted Scores 
Corresponding to 
Form D3 Raw Score 



Group 


25 


50 


75 


100 


125 


Total Population 


181 


314 


448 


582 


712 


Male 












Total Subpop'n 


163 


310 


442 


574 


712 


Sample 1 


a 


298 


445 


568 


695 


Sample 2 


a 


321 


439 


575 


718 


Female 












Total Subpop'n 


188 


316 


451 


589 


713 


San pie 1 


204 


313 


458 


591 


708 


Sample 2 


a 


302 


444 


582 


709 


White 












Total Subpop'n 


a 


317 


449 


583 


713 


Sample 1 


a 


318 


439 


569 


70* 


Minority 












Total Subpop'n 


183 


311 


445 


563 


673 


Sample 1 


183 


313 


446 


564 


674 


Humanities 












Total Subpop'n 


a 


304 


450 


586 


705 


Sample 1 


a 


316 


451 


586 


699 


Sample 2 


a 


314 


450 


584 


713 


Social Science 












Total Subpop'n 


191 


317 


452 


586 


714 


Sample 1 


197 


329 


449 


583 


735 


Sample 2 


198 


3i8 


451 


588 


714 



Biological Science 



Total Subpop'n 


a 


305 


444 


584 


711 


Sample 1 


a 


317 


444 


577 


689 


Sample 2 


a 


287 


438 


589 


715 
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Table IV (continued) 

Conversions from Selected Form D3 Raw Scores 
to the Form Dl Reference Scale, 
Based on the Total Population, Subpopulations , and Subpopulation Samples 

GRE Verbal-Plus-Quantitative Scores; Equipercentile equating 



Converted Scores 
Corresponding to 
Form D3 Raw Score 



Group 


25 


50 


75 


100 


125 


Physical Science 












Total Subpop'n 


170 


301 


419 


568 


713 


Sample 2 


a 


305 


427 


571 


715 


Sample 2 


a 


291 


416 


567 


718 


Sample 3 


a 


296 


423 


573 


713 


Sample 4 


a 


298 


424 


570 


712 


High-Scoring 












Total Subpop'n 


a 


a 


450 


581 


711 


Sample 1 


a 


a 


457 


584 


712 


Sample 2 


a 


a 


457 


576 


710 


Middle-Scoring 












Total Subpop'n 


a 


299 


443 


583 


734 


Sample 1 


a 


258 


446 


589 


^97 


Sample 2 


a 


289 


436 


579 


a 



Low-Scoring 



Total Subpop'n 


181 


315 


454 


589 


a 


Sample 1 


198 


320 


453 


574 


a 


Sample 2 


203 


319 


457 


604 


a 



No data are available at these ./oints to estab- 
lish conversions. 
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